xc-llm-ascend

Files

whx 5a2c5be229 [BugFix][Cherry-pick] Cherry-pick PR 3675 to v0.11.0-dev (#3732 )

This PR cherry-picks the bugfix related with running multi-modal models
with AscendScheduler to v0.11.0-dev

Signed-off-by: hw_whx <wanghexiang7@huawei.com>
Co-authored-by: hw_whx <wanghexiang7@huawei.com>

2025-10-25 09:41:51 +08:00

__init__.py

[Scheduler] Add AscendScheduler. (#543 )

2025-04-17 19:31:50 +08:00

recompute_schedule_config.py

[Bugfix] Route requests requiring KVC recomputation from the decode instance to the P instance (#3448 )

2025-10-18 15:56:44 +08:00

recompute_scheduler.py

[v0.11.0][BugFix][P/D] Modify the recalculation logic to prevent waiting requests from filling up the D node KVCache (#3686 )

2025-10-25 09:15:42 +08:00

schedule_config.py

[BugFix][Cherry-pick] Cherry-pick PR 3675 to v0.11.0-dev (#3732 )

2025-10-25 09:41:51 +08:00

scheduler.py

[BugFix] Fix ascend scheduler assert error (#3191 )

2025-09-28 18:22:08 +08:00