xc-llm-ascend

Author	SHA1	Message	Date
Li Wang	595d3484c4	[Nightly] Move ops to the correct path (#5642 ) ### What this PR does / why we need it? Move ops to the correct path where they belong - vLLM version: v0.13.0 - vLLM main: `2f4e6548ef` Signed-off-by: wangli <wangli858794774@gmail.com>	2026-01-09 09:23:36 +08:00
meihanc	6315a31399	[CI] Add triton ascend in nightly CI (#5716 ) ### What this PR does / why we need it? Add triton ascend in nightly ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: `2f4e6548ef` --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>	2026-01-08 21:17:32 +08:00
meihanc	c1dcddce3f	[CI]update bisheng version (#5621 ) ### What this PR does / why we need it? update bisheng version in 20260105 - vLLM version: v0.13.0 - vLLM main: `8be6432bda` Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>	2026-01-06 15:22:22 +08:00
meihanc	a034941d06	[CI] update triton-ascend version (#5584 ) ### What this PR does / why we need it? update triton-ascend version to 20260105 - vLLM version: v0.13.0 - vLLM main: `7157596103` --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>	2026-01-05 20:20:11 +08:00
meihanc	fbb93ad8f2	[bugfix]update bishengir source envs (#5582 ) ### What this PR does / why we need it? Due to the update of the Bisheng version's installation path, the corresponding source path in the environment variables needs to be updated. - vLLM version: v0.13.0 - vLLM main: `7157596103` --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>	2026-01-05 09:13:40 +08:00
meihanc	8c4e9bb76b	[CI]update triton ascend version (#5392 ) ### What this PR does / why we need it? update triton-ascend version to 1229 and bisheng version in 1225; - vLLM version: release/v0.13.0 - vLLM main: `254f6b9867` --------- Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>	2025-12-30 09:51:45 +08:00
Li Wang	c2f776b846	[Nightly] Initial logging for nightly multi-node testing (#5362 ) ### What this PR does / why we need it? Currently, our multi-node logs only show the master node's logs (via the Kubernetes API), which is insufficient for effective problem localization if other nodes experience issues. Therefore, this pull request adds the ability to upload logs for other nodes. Next plan: Output structured directory logs, including logs from each node and the polog. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: release/v0.13.0 - vLLM main: `bc0a5a0c08` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-12-26 11:39:07 +08:00
Li Wang	2f03a2f4a4	[CI] Skip some failed ops tests (#5309 ) ### What this PR does / why we need it? Skip some failed ops tests - vLLM version: release/v0.13.0 - vLLM main: `5fbfa8d9ef` Signed-off-by: wangli <wangli858794774@gmail.com>	2025-12-24 18:29:34 +08:00
Li Wang	243ab7d720	[CI] Use offline mode for nightly test (#5187 ) ### What this PR does / why we need it? For single node test, the lack of a retry mechanism for accessing ModelScope resulted in an HTTP 400 error sometimes. I recommend using a local offline cache instead. - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-12-19 21:21:42 +08:00
zhangxinyuehfad	cee9b715b5	[Bugfix] install trition for test_custom_op (#5112 ) ### What this PR does / why we need it? 1. install trition for test_custom_op 2. tests/e2e/nightly/ops test timeout, set timeout-minutes let it test over: https://github.com/vllm-project/vllm-ascend/actions/runs/20326482497/job/58392757707?pr=5112 3. ignore test_dispatch_ffn_combine until it is fixed @kiscad ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.12.0 - vLLM main: `ad32e3e19c` Signed-off-by: hfadzxy <starmoon_zhang@163.com>	2025-12-19 10:40:46 +08:00
whx	cee521bad5	[Nightly][BugFix] Install triton for nightly e2e op test. (#5096 ) ### What this PR does / why we need it? This PR adds triton-ascend installation to nightly e2e single card environment. Signed-off-by: whx-sjtu <2952154980@qq.com>	2025-12-16 21:31:53 +08:00
Li Wang	c6f60e8dd8	[Nightly] Upgrade single node test to latest main (#5101 ) ### What this PR does / why we need it? Sync source code from vllm-ascend on nightly tests Signed-off-by: wangli <wangli858794774@gmail.com>	2025-12-16 21:28:45 +08:00
SILONG ZENG	ab37a7d5ae	[main]Upgrade cann to 8.3rc2 (#4350 ) ### What this PR does / why we need it? Upgrade cann to 8.3rc2 ### Does this PR introduce _any_ user-facing change? Yes, docker image will use 8.3.RC2 - vLLM version: v0.11.2 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.2 --------- Signed-off-by: MrZ20 <2609716663@qq.com>	2025-11-28 14:06:01 +08:00
zhangxinyuehfad	67f2b3a031	[Test] Add deepseek v3.2 exp nightly test (#4191 ) ### What this PR does / why we need it? - skip the nightly image build when the github event is pull_request - set imagepullpolicy as alway for multi_node test - move multi_node tests ahead to have some resource clean first - do not relevant nightly image build with nightly tests for tolerance - vLLM version: v0.11.0 - vLLM main: `2918c1b49c` --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: wangli <wangli858794774@gmail.com> Co-authored-by: wangli <wangli858794774@gmail.com>	2025-11-14 15:46:10 +08:00
Li Wang	7294f89e43	[CI] Add daily images build for nightly ci (#3989 ) ### What this PR does / why we need it? Given the current excessively long build time of our nightly-ci, I recommend installing necessary, confirmed versions of packages in the Docker image to reduce the time required for integration testing. Including Mooncake vllm with fixed tags, This is expected to reduce nightly-ci duration by 2 hours. - vLLM version: v0.11.0 - vLLM main: `2918c1b49c` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-11-13 20:10:12 +08:00
zhangxinyuehfad	b77b4f1abf	[Test] Add nightly test for DeepSeek-V3.2-Exp (#3908 ) ### What this PR does / why we need it? Add nightly test for DeepSeek-V3.2-Exp ### How was this patch tested? test action： https://github.com/vllm-project/vllm-ascend/actions/runs/19156153634/job/54757008557?pr=3908 - vLLM version: v0.11.0 - vLLM main: `83f478bb19` --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com>	2025-11-11 10:29:57 +08:00
wangxiyuan	cc2cd42ad3	Upgrade CANN to 8.3.rc1 (#3945 ) ### What this PR does / why we need it? This PR upgrade CANN from 8.2rc1 to 8.3rc1 and remove the CANN version check logic. TODO: we notice that UT runs failed with CANN 8.3 image. So the base image for UT is still 8.2. We'll fix it later. - vLLM version: v0.11.0 - vLLM main: `83f478bb19` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-11-03 20:21:07 +08:00
Li Wang	eb0a2ee2d0	[CI] Optimize nightly CI (#3898 ) ### What this PR does / why we need it? This patch mainly fix the the problem of not being able to determine the exit status of the pod's entrypoint script and some other tiny optimizations: 1. Shorten wait for server timeout 2. fix typo 3. fix the issue of ais_bench failing to correctly access the proxy URL in a PD separation scenario. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.11.0 - vLLM main: `83f478bb19` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-10-30 23:42:20 +08:00
Li Wang	4a2ab13743	[CI] Optimize nightly CI (#3858 ) ### What this PR does / why we need it? This patch optimize nightly CI: 1. Bug fixes ais_bench get None repo_type error 2. Fix A2 install kubectl error with arm arch 3. Fix the multi_node CI unable to determine whether the job was successful error ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.11.0rc3 - vLLM main: `83f478bb19` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-10-29 22:30:19 +08:00
Li Wang	60ee4af6d0	[CI] Add custom op to nightly (#3765 ) ### What this PR does / why we need it? 1. Add custom op to nightly tests, fix https://github.com/vllm-project/vllm-ascend/pull/3665 2. Correctly pass github secrets when using workflow_call, see https://docs.github.com/en/actions/how-tos/reuse-automations/reuse-workflows 3. Fix the single node mutual cancellation issue - vLLM version: v0.11.0rc3 - vLLM main: `c9461e05a4` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-10-27 14:07:03 +08:00
Li Wang	7f73c28a24	[CI][Doc] Optimize multi-node CI (#3565 ) ### What this PR does / why we need it? This pull request mainly do the following things: 1. Add a doc for multi-node CI, The main content is the mechanism principle and how to contribute 2. Simplify the config yaml for more developer-friendly 3. Optimized the mooncake installation script to prevent accidental failures during installation 4. Fix the workflow to ensure the kubernetes can be apply correctly 5. Add Qwen3-235B-W8A8 disaggregated_prefill test 6. Add GLM-4.5 multi dp test 7. Add 2p1d 4nodes disaggregated_prefill test 8. Refactor nightly tests ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.11.0rc3 - vLLM main: `17c540a993` --------- Signed-off-by: wangli <wangli858794774@gmail.com>	2025-10-25 09:23:47 +08:00

21 Commits