xc-llm-ascend

Files

whx 5998704c08 [BugFix] Fix ascend scheduler bugs. (#822 )

This PR fixes two bugs in AscendScheduler:
1. When running with high concurrency, the length of running queue may
exceed the limit of max_num_seqs
2. When some requests are prempted and recomputing is activated, the
logic of computing new tokens is wrong.

Signed-off-by: whx-sjtu <2952154980@qq.com>

2025-05-12 21:15:17 +08:00

__init__.py

[Scheduler] Add AscendScheduler. (#543 )

2025-04-17 19:31:50 +08:00

schedule_config.py

[MISC] fix format check error (#654 )

2025-04-29 11:14:19 +08:00

scheduler.py

[BugFix] Fix ascend scheduler bugs. (#822 )

2025-05-12 21:15:17 +08:00