xc-llm-ascend

Files

Icey c929bd1e8d [Fusion] [Graph]Add Matmul Allreduce Rmsnorm fusion Pass (#5034 )

This PR add `MatmulAllreduceRmsnorm` operator and introduces a graph
fusion pass for `matmul_allreduce_rmsnorm` operations. The
implementation includes a new configuration flag, a pattern matching
pass using `torch._inductor.pattern_matcher`.

Co-authored-by: Trunrain [270250579@qq.com](mailto:270250579@qq.com)

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

---------

Signed-off-by: wxsIcey <1790571317@qq.com>
Signed-off-by: tongrunze <t00574058@china.huawei.com>

2026-01-19 09:28:07 +08:00

platform

[Feature]: Support 310P device run qwen2.5/3 dense and qwen2.5vl models (#5776 )

2026-01-17 11:49:18 +08:00

worker

[Fusion] [Graph]Add Matmul Allreduce Rmsnorm fusion Pass (#5034 )

2026-01-19 09:28:07 +08:00

__init__.py

[Fusion] [Graph]Add Matmul Allreduce Rmsnorm fusion Pass (#5034 )

2026-01-19 09:28:07 +08:00