Files
xc-llm-ascend/vllm_ascend
whx f286265791 [BugFix] Address PrefillCacheHit state to fix prefix cache accuracy bug (#1498)
When use AscendScheduler with prefix-cache enabled and chunk-prefill
disabled, there will be accuray problem because there is no branch in
mla_v1 to process this scenario. This PR fixes it.

Signed-off-by: whx-sjtu <2952154980@qq.com>
2025-06-30 16:51:20 +08:00
..
2025-04-22 08:57:25 +08:00
2025-06-28 16:14:49 +08:00
2025-06-23 22:03:38 +08:00
2025-06-27 09:14:43 +08:00