[Feat] allow using aclgraph in ray backend (#2589)

### What this PR does / why we need it? Allow using aclgraph in ray backend, for tp + pp + aclgraph in multi machine ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.10.1.1 - vLLM main: 4ba0c587ba Signed-off-by: withHades <244036962@qq.com>
2025-09-04 11:45:56 +08:00
parent aff5189c87
commit 0c0789be74
2 changed files with 0 additions and 36 deletions
--- a/vllm_ascend/platform.py
+++ b/vllm_ascend/platform.py
@@ -185,12 +185,6 @@ class NPUPlatform(Platform):
                    "and use_cached_kv_cache_bytes in torchair_graph_config.")
                delete_torchair_cache_file()

-        if parallel_config.distributed_executor_backend == "ray":
-            logger.warning(
-                "Ray distributed executor backend is not compatible with ACL Graph mode "
-                "right now. Setting CUDAGraphMode to NONE")
-            compilation_config.cudagraph_mode = CUDAGraphMode.NONE
-
        # set cudaprah sizes before extending `compilation_config.splitting_ops`
        vllm_config._set_cudagraph_sizes()