xc-llm-ascend

Files

Wang Yixuan a7b40b09eb [BugFix]fix deepseek torchair recompile (#3678 )

### What this PR does / why we need it?
The #3624 PR fix the precision of deepseek torchair, but don't consider
the limitation of torch compile which results in the recompile, This PR
fixs this problem

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: hust17yixuan <303660421@qq.com>

2025-10-23 22:53:01 +08:00

models

perf : optimize memory for deepseek mtp (#2713 )

2025-10-23 15:52:17 +08:00

ops

[BugFix]fix deepseek torchair recompile (#3678 )

2025-10-23 22:53:01 +08:00

quantization

[Feat][quantization] Support new version w4a8 dynamic quantization for Linear layers (#3311 )

2025-10-21 20:18:39 +08:00

__init__.py

[1/4][Refactor] Refactor torchair worker (#1885 )