xc-llm-ascend

Files

ChenCangtao fa56abea9f [bugfix][npugraph_ex]duplicate pattern issue (#6513 )

### What this PR does / why we need it?
When the draft model also uses vllmbackend for graph compilation, the
fusion pass registration occurs again, resulting in errors due to
duplicate patterns.
### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.15.0
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.15.0

---------

Signed-off-by: chencangtao <chencangtao@huawei.com>
Co-authored-by: chencangtao <chencangtao@huawei.com>

2026-02-04 08:49:13 +08:00

__init__.py

[Graph][Fusion] Add QKVNormRope and QKVNormRopeWithBias (#5721 )

2026-01-22 17:22:41 +08:00

npugraph_ex_utils_check.py

[bugfix][npugraph_ex]duplicate pattern issue (#6513 )

2026-02-04 08:49:13 +08:00