[BugFix] Fix chunked prefill bugs in engine v1 (#844)

### What this PR does / why we need it?
Fix the bugs when run deepseek model in engine v1.

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
CI passed with new added/existing test.

---------

Signed-off-by: rjg-lyh <1318825571@qq.com>
This commit is contained in:
rjg-lyh
2025-05-22 10:33:50 +08:00
committed by GitHub
parent a73bd6caf4
commit b4d6672d01
2 changed files with 14 additions and 3 deletions

View File

@@ -204,6 +204,9 @@ class NPUPlatform(Platform):
"ascend_scheduler_config", None) is not None:
additional_scheduler_config = additional_config.get(
"ascend_scheduler_config")
if vllm_config.scheduler_config.enable_chunked_prefill:
additional_scheduler_config[
"enable_chunked_prefill"] = True
from vllm_ascend.core.schedule_config import \
AscendSchedulerConfig
ascend_scheduler_config = AscendSchedulerConfig.initialize_from_config(