xc-llm-ascend

Files

zzzzwwjj 052e472453 [bugfix] fix w8a8dynamic fused_moe trans nz (#5199 )

### What this PR does / why we need it?
Currently, `torch_npu.npu_grouped_matmul_swiglu_quant` can only support
weight nz, so we need to trans w13_weight, w2_weight to nz forcely.

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

Signed-off-by: zzzzwwjj <1183291235@qq.com>

2025-12-22 17:45:34 +08:00

e2e

[feature] support pcp + mtp in full graph (#4572 )

2025-12-22 16:13:39 +08:00

[bugfix] fix w8a8dynamic fused_moe trans nz (#5199 )

2025-12-22 17:45:34 +08:00

__init__.py

[SpecDecode] Add spec decode support (#500 )

2025-04-17 20:16:32 +08:00