xc-llm-ascend

Files

fan2956 f2d8493221 [BugFix] Fix ascend scheduler assert error (#3191 )

### What this PR does / why we need it?
Running multimodal model with ascend scheduler may cause assert error
【assert (request.num_tokens - request.num_computed_tokens) == 1】

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?


- vLLM version: v0.10.2
- vLLM main:
17b4c6685c

---------

Signed-off-by: fan2956 <zhoufan53@huawei.com>

2025-09-28 18:22:08 +08:00

__init__.py

[Scheduler] Add AscendScheduler. (#543 )

2025-04-17 19:31:50 +08:00

schedule_config.py

[CORE] concurrent partial prefills (#2372 )

2025-09-24 17:12:55 +08:00

scheduler.py

[BugFix] Fix ascend scheduler assert error (#3191 )

2025-09-28 18:22:08 +08:00