[BugFix] Fix chunked prefill bugs in engine v1 (#844)
### What this PR does / why we need it? Fix the bugs when run deepseek model in engine v1. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? CI passed with new added/existing test. --------- Signed-off-by: rjg-lyh <1318825571@qq.com>
This commit is contained in:
@@ -204,6 +204,9 @@ class NPUPlatform(Platform):
|
||||
"ascend_scheduler_config", None) is not None:
|
||||
additional_scheduler_config = additional_config.get(
|
||||
"ascend_scheduler_config")
|
||||
if vllm_config.scheduler_config.enable_chunked_prefill:
|
||||
additional_scheduler_config[
|
||||
"enable_chunked_prefill"] = True
|
||||
from vllm_ascend.core.schedule_config import \
|
||||
AscendSchedulerConfig
|
||||
ascend_scheduler_config = AscendSchedulerConfig.initialize_from_config(
|
||||
|
||||
Reference in New Issue
Block a user