xc-llm-ascend

Files

whx bd11c0054f [BugFix] Fix torchair+mtp bug after deleting deepseek_mtp. (#3590 )

This is a missing bug fix introduced by PR #3561

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>

2025-10-21 22:23:52 +08:00

__init__.py

[1/N][refactor] torchair deepseek modeling refactor (#2384 )

2025-08-18 15:00:37 +08:00

qwen2.py

[KVCache][Bugfix] Fix kv cache initialization error of attention layer (#3113 )

2025-09-24 11:32:34 +08:00

qwen3_moe.py

[MoE] [Refactor] Combine common_fused_moe and fused_moe (#3176 )

2025-10-09 14:12:46 +08:00

torchair_deepseek_mtp.py

[BugFix] Fix torchair+mtp bug after deleting deepseek_mtp. (#3590 )

2025-10-21 22:23:52 +08:00

torchair_deepseek_v2.py

[feat][torchair] support super kernel feat for quantized dsr1 (#3485 )

2025-10-20 20:04:37 +08:00

torchair_deepseek_v3.py

[1/N][refactor] torchair deepseek modeling refactor (#2384 )

2025-08-18 15:00:37 +08:00

torchair_pangu_moe.py

[KVCache][Bugfix] Fix kv cache initialization error of attention layer (#3113 )

2025-09-24 11:32:34 +08:00