fix error async_scheduler can't be enabled (#3127)

### What this PR does / why we need it?
PR #2894 make ascend_scheduler_config.enabled always be `True` for
non-mla models,when `ascend_scheduler_config.enabled=True `, it will
always initialize `AscendScheduler` which is a subclass of `Scheduler`,
but when we enbale async_scheduling,we need to initialize
`AsyncScheduler` in vllm, this will make async_scheduling can't be
enabled.

### Does this PR introduce _any_ user-facing change?
not-related

### How was this patch tested?
when user set `async_scheduling`, it means user don't want to use
`AscendScheduler`, so we shouldn't set `ascend_scheduler_config.enabled
= True`

- vLLM version: v0.10.2
- vLLM main:
f225ea7dd9

Signed-off-by: Ronald1995 <ronaldautomobile@163.com>
This commit is contained in:
Ronald
2025-09-26 08:51:54 +08:00
committed by GitHub
parent 14497b748d
commit 621aa7d270

View File

@@ -135,7 +135,8 @@ class NPUPlatform(Platform):
else:
structured_outputs_config = vllm_config.structured_outputs_config
if model_config is not None and not model_config.use_mla:
if (model_config is not None and not model_config.use_mla
and not scheduler_config.async_scheduling):
logger.info(
"Non-MLA LLMs forcibly disable the chunked prefill feature,"
"as the performance of operators supporting this feature "