xc-llm-ascend/ut at 18b90b501d6aad1d9426dcdee1ccfbe8139dd47d - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

History

wangxiyuan 7f2673ea2d upgrade vLLM to main (#4608 )

1. fix https://github.com/vllm-project/vllm/pull/28542
The model structure modifications we involved in are:
     - Qwen2.5-VL(still exist some patch)
     - Qwen2-VL
     - Qwen2
     - DeepSeek series
     - Qwen-moe series
2. fix https://github.com/vllm-project/vllm/pull/29121
   the output token now  type changed from np to `list[list[int]]`

3. fix https://github.com/vllm-project/vllm/pull/29262
    `xformers` backend for multimodal now has been deprecated
4. fix https://github.com/vllm-project/vllm/pull/29342

5. fix https://github.com/vllm-project/vllm/pull/28579
6. fix https://github.com/vllm-project/vllm/pull/28718
7. fix https://github.com/vllm-project/vllm/issues/28665
8. fix https://github.com/vllm-project/vllm/pull/26847
vllm introduced the `optimization-level`, some default config has been
changed, and the param `--enforce-eager` has been deprecated
9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple
for sampler.
10. fix https://github.com/vllm-project/vllm/pull/29471 we'll remove the
related patch to avoid this kind of error.

Co-authored-by: hfadzxy <starmoon_zhang@163.com>
Co-authored-by: wangli <wangli858794774@gmail.com>


- vLLM version: v0.11.2

---------

Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: hfadzxy <starmoon_zhang@163.com>
Co-authored-by: wangli <wangli858794774@gmail.com>
Co-authored-by: hfadzxy <starmoon_zhang@163.com>

2025-12-02 22:10:52 +08:00

..

upgrade vLLM to main (#4608 )

2025-12-02 22:10:52 +08:00

upgrade vLLM to main (#4608 )

2025-12-02 22:10:52 +08:00

upgrade vLLM to main (#4608 )

2025-12-02 22:10:52 +08:00

device_allocator

add ut for device allocator/camem and mutistream/layers (#2037 )

2025-07-31 19:17:27 +08:00

[Bugfix] Fix bug with establishing the flashcomm2 and pp communication domains. (#4458 )

2025-12-01 15:56:22 +08:00

eplb redundant expert bugfix (#4291 )

2025-11-21 14:24:35 +08:00

[CI] Add unit test framework (#1201 )

2025-06-16 18:32:28 +08:00

upgrade vLLM to main (#4608 )

2025-12-02 22:10:52 +08:00

model_loader/netloader

Drop 0.11.0 support (#4377 )

2025-11-24 17:08:20 +08:00

Move mla to ops module (#4575 )

2025-11-29 18:36:55 +08:00

[Feat] shared expert dp for deepseek_mtp (#3811 )

2025-12-01 20:44:11 +08:00

patch/worker/patch_common

[Refactor] refactor patch module (#3555 )

2025-10-21 20:19:46 +08:00

[CI] Drop ascend scheduler from test (#4613 )

2025-12-02 13:18:17 +08:00

[UT] Fix test_sample_recovered_tokens_pytorch_autoregressive (#3434 )

2025-10-24 11:20:57 +08:00

upgrade vLLM to main (#4608 )

2025-12-02 22:10:52 +08:00

upgrade vLLM to main (#4608 )

2025-12-02 22:10:52 +08:00

[bugfix] fix ray start failed: local_world_size cannot little than visible device count error (#4457 )

2025-11-27 21:18:32 +08:00

__init__.py

[2/4][Refactor] Refactor torchair utils (#1892 )

2025-07-21 19:43:30 +08:00

base.py

[Feature]: implement the fusion of allreduce and matmul in prefill phase when tp is enabled (#1926 )

2025-07-28 15:13:37 +08:00

conftest.py

[1/N][CustomOp] Register activation customop instead of overwrite forward_oot (#1841 )

2025-07-18 23:07:14 +08:00

test_ascend_config.py

[CI] Drop ascend scheduler from test (#4613 )

2025-12-02 13:18:17 +08:00

test_envs.py

[Misc] Remove redundant imported envs, using envs_ascend instead (#2193 )

2025-08-14 09:33:39 +08:00

test_platform.py

[CI] Drop ascend scheduler from test (#4613 )

2025-12-02 13:18:17 +08:00

test_utils.py

[CI] Drop ascend scheduler from test (#4613 )

2025-12-02 13:18:17 +08:00