[BugFix][Fusion] Fix graph fusion failure problem (#5676)

Currently, the vllm pull request (https://github.com/vllm-project/vllm/pull/24252) is causing operator fusion to fail. This issue was previously fixed by patching the backend. The root cause has been identified, and the problem can be resolved with this pull request. - vLLM version: v0.13.0 - vLLM main: 2f4e6548ef --------- Signed-off-by: wxsIcey <1790571317@qq.com>
2026-01-07 18:42:55 +08:00
parent 137f28341d
commit b94fc13d3f
8 changed files with 37 additions and 265 deletions
--- a/vllm_ascend/compilation/passes/norm_quant_fusion_pass.py
+++ b/vllm_ascend/compilation/passes/norm_quant_fusion_pass.py
@@ -20,6 +20,7 @@ import torch._inductor.pattern_matcher as pm
 from torch._inductor.pattern_matcher import PatternMatcherPass
 from vllm.compilation.vllm_inductor_pass import VllmInductorPass
 from vllm.config import VllmConfig
+from vllm.config.compilation import Range
 from vllm.logger import logger


@@ -308,7 +309,7 @@ class AddRMSNormQuantFusionPass(VllmInductorPass):
        logger.debug("Replaced %s patterns", self.matched_count)
        self.end_and_log()

-    def is_applicable(self, runtime_shape: int | None = None) -> bool:
+    def is_applicable_for_range(self, compile_range: Range) -> bool:
        """
        Check if the pass is applicable for the current configuration.
        """