xc-llm-ascend

Author	SHA1	Message	Date
Li Wang	2967e5e22a	[Benchmark] Correctly kill vllm process in performance benchamrk (#2782 ) ### What this PR does / why we need it? vLLM now names the process with VLLM prefix after https://github.com/vllm-project/vllm/pull/21445, we should kill the correct process name after one iteration benchmark to avoid OOM issue ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.10.1.1 - vLLM main: `e599e2c65e` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-09-07 10:36:34 +08:00
Li Wang	9cd4ac76a1	[CI] Remove benchmark patch and increase the scheduler frequency (#1762 ) ### What this PR does / why we need it? This pr purpose to do the following things: 1. Remove `benchmark_datasets.py` patch 2. Increase the scheduler frequency to 2 times per day, due to the recent large number of daily submissions, we need to increase the default test time(6h) ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.9.2 - vLLM main: `247102f07f` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-07-13 20:00:35 +08:00
zhangxinyuehfad	4e910186de	[CI/UT] Unify model usage via ModelScope in CI (#1207 ) ### What this PR does / why we need it? Unify Model Usage via ModelScope ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed Signed-off-by: hfadzxy <starmoon_zhang@163.com>	2025-07-04 10:52:17 +08:00
Li Wang	6db7dc2c85	[Benchmark] Refactor perf script to use benchmark cli (#1524 ) ### What this PR does / why we need it? Since, `vllm bench` cli has optimized enough for production use(support more datasets), we are now do not need to copy vllm codes, now , with vllm installed, we can easily use the benchmark cli ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-06-30 23:42:04 +08:00
Li Wang	c563a08f0a	[CI] Fix nightly benchmark (#1453 ) ### What this PR does / why we need it? Sometimes the performance benchmark workflow may fail. We hope to add a prompt when the operation fails and not upload the dirty data of the failed operation. --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-06-26 19:39:18 +08:00
Li Wang	76dacf3fa0	[CI][Benchmark] Optimize performance benchmark workflow (#1039 ) ### What this PR does / why we need it? This is a post patch of #1014, for some convenience optimization - Set cached dataset path for speed - Use pypi to install escli-tool - Add benchmark results convert script to have a developer-friendly result - Patch the `benchmark_dataset.py` to disable streaming load for internet - Add more trigger ways for different purpose, `pr` for debug, `schedule` for daily test, `dispatch` and `pr-labled` for manual testing of a single(current) commit - Disable latency test for `qwen-2.5-vl`, (This script does not support multi-modal yet) ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-06-03 23:38:34 +08:00
Li Wang	d9fb027068	[CI] Add benchmark workflows (#1014 ) ### What this PR does / why we need it? Add benchmark workflows ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Run locally --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-05-30 22:42:44 +08:00
Li Wang	866ce7168c	[Benchmark] Download model from modelscope (#634 ) ### What this PR does / why we need it? - Run benchmark scripts will Download model from modelscope Signed-off-by: wangli <wangli858794774@gmail.com>	2025-04-24 14:48:24 +08:00
Li Wang	9a175ca0fc	[Doc]Add benchmark scripts (#74 ) ### What this PR does / why we need it? The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code . --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-03-21 15:54:34 +08:00

9 Commits