xc-llm-ascend

Author	SHA1	Message	Date
Li Wang	2967e5e22a	[Benchmark] Correctly kill vllm process in performance benchamrk (#2782 ) ### What this PR does / why we need it? vLLM now names the process with VLLM prefix after https://github.com/vllm-project/vllm/pull/21445, we should kill the correct process name after one iteration benchmark to avoid OOM issue ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.10.1.1 - vLLM main: `e599e2c65e` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-09-07 10:36:34 +08:00
Li Wang	6db7dc2c85	[Benchmark] Refactor perf script to use benchmark cli (#1524 ) ### What this PR does / why we need it? Since, `vllm bench` cli has optimized enough for production use(support more datasets), we are now do not need to copy vllm codes, now , with vllm installed, we can easily use the benchmark cli ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-06-30 23:42:04 +08:00
Li Wang	dd207cb261	[CI][Benchmark] Add new model and v1 test to perf benchmarks (#1099 ) ### What this PR does / why we need it? - Add qwen2.5-7b-instruct test - Add v1 test --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-06-12 10:46:41 +08:00
Li Wang	76dacf3fa0	[CI][Benchmark] Optimize performance benchmark workflow (#1039 ) ### What this PR does / why we need it? This is a post patch of #1014, for some convenience optimization - Set cached dataset path for speed - Use pypi to install escli-tool - Add benchmark results convert script to have a developer-friendly result - Patch the `benchmark_dataset.py` to disable streaming load for internet - Add more trigger ways for different purpose, `pr` for debug, `schedule` for daily test, `dispatch` and `pr-labled` for manual testing of a single(current) commit - Disable latency test for `qwen-2.5-vl`, (This script does not support multi-modal yet) ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-06-03 23:38:34 +08:00
Li Wang	d9fb027068	[CI] Add benchmark workflows (#1014 ) ### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-05-30 22:42:44 +08:00
Li Wang	218f21de21	[Benchmarks] Add qwen2.5-7b test (#763 ) ### What this PR does / why we need it? - Add qwen2.5-7b test - Optimize the documentation to be more developer-friendly Signed-off-by: xuedinge233 <damow890@gmail.com> Co-authored-by: xuedinge233 <damow890@gmail.com>	2025-05-10 09:47:42 +08:00
Li Wang	866ce7168c	[Benchmark] Download model from modelscope (#634 ) ### What this PR does / why we need it? - Run benchmark scripts will Download model from modelscope Signed-off-by: wangli <wangli858794774@gmail.com>	2025-04-24 14:48:24 +08:00
Li Wang	9a175ca0fc	[Doc]Add benchmark scripts (#74 ) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-03-21 15:54:34 +08:00

8 Commits