Files
xc-llm-ascend/vllm_ascend
Feng Liu 49838d4bec [Perf] vectorize PCP/DCP loops in attention_cp.py (#4944)
### What this PR does / why we need it?
- Add explicit .contiguous() after permute/view to ensure mem-friendly
layout
- Replace nested PCP/DCP Python loops with fully vectorized tensor
operations

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

---------

Signed-off-by: F.Liu <liufeng248@huawei.com>
Co-authored-by: F.Liu <liufeng248@huawei.com>
2025-12-22 11:06:19 +08:00
..
2025-12-20 17:03:25 +08:00
2025-12-18 22:20:45 +08:00
2025-12-02 22:10:52 +08:00
2025-12-11 18:45:43 +08:00
2025-12-20 09:38:53 +08:00
2025-12-02 17:35:47 +08:00
2025-12-20 09:38:53 +08:00