xc-llm-ascend

Author	SHA1	Message	Date
wangxiyuan	c0e12143a3	[CI] Fix UT failure (#2563 ) UT is broken by vLLM commit https://github.com/vllm-project/vllm/pull/23664 This PR mock the related config to recover the CI - vLLM version: v0.10.1.1 - vLLM main: `6dab89b8ec` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-08-27 11:24:35 +08:00
wangxiyuan	7e494e94a9	[CI] Fix broken ci (#2530 ) vLLM commit https://github.com/vllm-project/vllm/pull/22711 changed the encode cache entries logic, this PR adapt the same change for vllm ascend to make CI happy. Co-Authored-By: zhoux77899 <zhouxiang100@huawei.com> - vLLM version: v0.10.1.1 - vLLM main: `0ff902f3b4` Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2025-08-26 07:42:24 +08:00
linfeng-yuan	4af5b80606	[Scheduler] validate max_num_batched_tokens and max_model_len in AscendSchedulerConfig (#2434 ) ### What this PR does / why we need it? Add configuration check logic for ascend scheduler: if chunked_prefill is disabled, max_num_batched_tokens couldn't be less than max_model_len, following vLLM; ### Does this PR introduce _any_ user-facing change? users cannot set max_num_batched_tokens smaller than max_model_len with ascend scheduler ### How was this patch tested? CI and vllm serving passed - vLLM version: v0.10.0 - vLLM main: `f77a0802b7` Signed-off-by: linfeng-yuan <1102311262@qq.com>	2025-08-23 19:39:44 +08:00
Mengqing Cao	b0403f8d8a	[CI] fix ci (#2464 ) ### What this PR does / why we need it? 1. use action/checkout@v5 instead of v4 2. remove dbo test case because there is issue with it and will be refactored later 3. make vllm-ascend compatible with vllm v0.10.1.1 and add CI for it 4. fix sampler api changes introduced by https://github.com/vllm-project/vllm/pull/22387 6. fix qwen3 moe config changes intruoduced by https://github.com/vllm-project/vllm/pull/20562 7. fix kvcache block changes introduced by https://github.com/vllm-project/vllm/pull/23262 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with existing test. - vLLM version: v0.10.0 - vLLM main: `0c6e40bbaa` --------- Signed-off-by: MengqingCao <cmq0113@163.com>	2025-08-22 07:30:48 +08:00
wangxiyuan	eccfb715f6	[CI] Fix UT (#2452 ) Make UT CI happy - vLLM version: v0.10.0 - vLLM main: `d983769c41` --------- Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: MengqingCao <cmq0113@163.com> Co-authored-by: MengqingCao <cmq0113@163.com>	2025-08-20 16:26:07 +08:00
Mengqing Cao	1327f9be1c	Fix some ci issue and refactor modelrunner (#2445 ) ### What this PR does / why we need it? Fix some ci issue and refactor modelrunner ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with existing test. - vLLM version: v0.10.0 - vLLM main: `4d9c61993a` --------- Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: weiguihua2 <weiguihua2@huawei.com> Co-authored-by: wangli <wangli858794774@gmail.com> Co-authored-by: weiguihua2 <weiguihua2@huawei.com>	2025-08-20 09:01:04 +08:00
Mengqing Cao	61866b8ac6	[Quickfix] update CachedRequestState as NewRequestData changed (#2367 ) ### What this PR does / why we need it? 1. update `CachedRequestState` as `NewRequestData` changed in https://github.com/vllm-project/vllm/pull/22570 2. drop maintenance of vllm v0.10.0 in the branch main ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with existing test. - vLLM version: v0.10.0 - vLLM main: `92ff41abea` --------- Signed-off-by: MengqingCao <cmq0113@163.com>	2025-08-15 07:35:27 +08:00
SunnyLee151064	ae560f7131	[Test] Add uts for files in /core (#1957 ) ### What this PR does / why we need it? Add uts for files in folder /core ### Does this PR introduce _any_ user-facing change? No - vLLM version: v0.9.2 - vLLM main: `5a19a6c670` --------- Signed-off-by: lwq <liwenquan5@huawei.com> Co-authored-by: lwq <liwenquan5@huawei.com>	2025-07-25 09:48:19 +08:00
JohnJan	ce4970eee0	[Test] Add unit test for schedule_config.py (#1590 ) What this PR does / why we need it? According to issue https://github.com/vllm-project/vllm-ascend/issues/1298 , this pull request adds unit test code for schedule_config.py. Does this PR introduce any user-facing change? No How was this patch tested? CI passed with new added/existing test. - vLLM version: v0.9.2 - vLLM main: `8d0a01a5f2`	2025-07-22 11:43:25 +08:00

9 Commits