Files
xc-llm-ascend/vllm_ascend
zouyida2052 ee0a95e47f bugfix for mtp when running torchair in a2 (#3354)
### What this PR does / why we need it?
when ops torchair_fused_experts_with_mc2 is called, we need pass a tp
group, but now it only pass when quantized scenario, we need also pass
it when unquantized.

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: zouyida2052 <zouyida2002@gmail.com>
2025-10-10 23:07:24 +08:00
..
2025-09-23 10:27:14 +08:00
2025-10-10 14:09:53 +08:00
2025-10-09 10:28:38 +08:00
2025-10-09 19:22:46 +08:00
2025-09-30 03:25:58 +08:00
2025-10-10 14:09:53 +08:00