xc-llm-ascend

Files

wangxiyuan b350edae9a [UT] refactor test_expert_load_balancer and fix broken CI (#1293 )

refactor test_expert_load_balancer to keep the ut code style

This PR also fixed the break change from
https://github.com/vllm-project/vllm/pull/16188/files#diff-e2942ece30a5c580437694ffb964bfc664b510c59244c08e5921b8f5cefb4280

This is just a quick fix. We'll support embedding on V1 later

Closes: https://github.com/vllm-project/vllm-ascend/issues/1299

Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>

2025-06-20 01:02:52 +08:00

e2e

[UT] refactor test_expert_load_balancer and fix broken CI (#1293 )

2025-06-20 01:02:52 +08:00

multicard

[DP][V1] Fix rank set in DP scenario & Bump torch-npu version to 2.5.1.post1.dev20250528 (#1235 )

2025-06-16 23:09:53 +08:00

[UT] refactor test_expert_load_balancer and fix broken CI (#1293 )

2025-06-20 01:02:52 +08:00

__init__.py

[SpecDecode] Add spec decode support (#500 )

2025-04-17 20:16:32 +08:00