xc-llm-ascend

Files

linfeng-yuan 0ca3f48c90 [2/N][refactor] torchair deepseek mla backend refactor (#2459 )

### What this PR does / why we need it?
This PR move current unified mla backend to torchair folder and remove
torchair-related code in attention/mla_v1.py (1.3k -> 0.9k).

 
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
Running eager mode with mla backend, and torchair mode with code before
[2445](https://github.com/vllm-project/vllm-ascend/pull/2445)


- vLLM version: v0.10.0
- vLLM main:
f571ff8eb6

Signed-off-by: linfeng-yuan <1102311262@qq.com>

2025-08-21 14:02:30 +08:00

test_attention_mask.py

[Bugfix] Fix accuracy problem caused by mask pollution (#1678 )

2025-07-10 14:06:49 +08:00

test_attention_v1.py

Fix some ci issue and refactor modelrunner (#2445 )

2025-08-20 09:01:04 +08:00

test_mla_v1.py

[2/N][refactor] torchair deepseek mla backend refactor (#2459 )

2025-08-21 14:02:30 +08:00