xc-llm-ascend

Files

hucong 12bc78d252 [v0.11.0][BugFix][P/D] Modify the recalculation logic to prevent waiting requests from filling up the D node KVCache (#3686 )

### What this PR does / why we need it?
Modify the recalculation logic to prevent waiting requests from filling
up the D node KVCache

Signed-off-by: underfituu <hzhucong@163.com>

2025-10-25 09:15:42 +08:00

__init__.py

[Scheduler] Add AscendScheduler. (#543 )

2025-04-17 19:31:50 +08:00

recompute_schedule_config.py

[Bugfix] Route requests requiring KVC recomputation from the decode instance to the P instance (#3448 )

2025-10-18 15:56:44 +08:00

recompute_scheduler.py

[v0.11.0][BugFix][P/D] Modify the recalculation logic to prevent waiting requests from filling up the D node KVCache (#3686 )

2025-10-25 09:15:42 +08:00

schedule_config.py

[CORE] concurrent partial prefills (#2372 )

2025-09-24 17:12:55 +08:00

scheduler.py

[BugFix] Fix ascend scheduler assert error (#3191 )

2025-09-28 18:22:08 +08:00