Files
xc-llm-ascend/vllm_ascend
Jade Zheng 40bd602485 [Feature] Use reshape_and_cache fused op (#706)
Replace torch function with reshape_and_cache fused op for better
performance. The `reshape_and_cache` function wasn't working because it
expected torch.int32 tensor, but a torch.int64 tensor was provided.

Signed-off-by: Jade Zheng <zheng.shoujian@outlook.com>
2025-04-28 21:54:42 +08:00
..
2025-04-22 08:57:25 +08:00
2025-04-23 20:56:24 +08:00