Files
xc-llm-ascend/vllm_ascend
whx 3393d53b36 [Scheduler][MTP] Add support for speculative decoding in AsecendScheduler. (#943)
This PR adds support for speculative decoding in AsecendScheduler.
Also inculde part of support for disaggregated prefill, full support
will be merged in follow-up PR.

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
2025-06-11 20:55:44 +08:00
..
2025-04-22 08:57:25 +08:00
2025-06-11 16:33:11 +08:00
2025-06-11 16:33:11 +08:00
2025-06-09 14:08:18 +08:00