xc-llm-ascend

Files

whx bd11c0054f [BugFix] Fix torchair+mtp bug after deleting deepseek_mtp. (#3590 )

This is a missing bug fix introduced by PR #3561

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>

2025-10-21 22:23:52 +08:00

models

[BugFix] Fix torchair+mtp bug after deleting deepseek_mtp. (#3590 )

2025-10-21 22:23:52 +08:00

ops

[feat][torchair] support super kernel feat for quantized dsr1 (#3485 )

2025-10-20 20:04:37 +08:00

quantization

[Feat][quantization] Support new version w4a8 dynamic quantization for Linear layers (#3311 )

2025-10-21 20:18:39 +08:00

__init__.py

[1/4][Refactor] Refactor torchair worker (#1885 )

2025-07-21 11:50:46 +08:00

torchair_attention.py

[Feat][Graph] Support FULL_DECODE_ONLY mode for GQA/MHA models (#2128 )

2025-09-22 17:14:28 +08:00

torchair_mla.py

[Model][1/N] Delete deepseek v2/v3 modeling codes. (#3189 )

2025-10-20 15:31:34 +08:00

torchair_model_runner.py

[BugFix]Fix mtp torchair bug caused by #2719 (#3566 )

2025-10-21 22:21:44 +08:00

torchair_sfa.py

[Refactor] Adapt deepseek-v3.2 to vllm 0.11.0 (#3432 )

2025-10-15 17:48:58 +08:00

torchair_worker.py

[CI] Upgrade vllm to newest commit (#3182 )

2025-09-26 06:18:15 +08:00

utils.py

[feat][torchair] support super kernel feat for quantized dsr1 (#3485 )

2025-10-20 20:04:37 +08:00