[CI] optimize lint term (#5986)

### What this PR does / why we need it? This patch purpose to optimize the lint check term. The main idea is to reduce unnecessary installation time. 1. The installation of vllm is not must, only append the path of vllm src to the `PATHONPATH` is effective 2. This installation of `requirements-dev.txt` is not must, we have a pre-built image `quay.io/ascend-ci/vllm-ascend:lint` with all the requirements installed in advance. **NOTE**: the conditions for triggering image builds are: 1).Daily scheduled build; 2) Build when requirements are modified; 3) Manual build. This ensures that the dependencies in our image are up-to-date to the greatest extent possible. 3. The `mypy` was separated from the `pre-commit` hook for performance reasons; we found that integrating `mypy` into the `pre-commit` hook resulted in poor performance. 4. Reduce the CPU core consumption from 16 -> 8 ### Does this PR introduce _any_ user-facing change? The end-to-end lint time was optimized from 20min/per PR to 8min/per PR ### How was this patch tested? - vLLM version: v0.13.0 - vLLM main: 2c24bc6996 --------- Signed-off-by: wangli <wangli858794774@gmail.com>
2026-01-22 15:46:59 +08:00
parent 9bba0a2a68
commit 484e7c59dc
10 changed files with 196 additions and 77 deletions
--- a/vllm_ascend/attention/attention_v1.py
+++ b/vllm_ascend/attention/attention_v1.py
@@ -46,9 +46,7 @@ from vllm_ascend.device.device_op import DeviceOperator
 from vllm_ascend.ops.flashcomm2_oshard_manager import flashcomm2_oshard_manager
 from vllm_ascend.utils import vllm_version_is, weak_ref_tensors

-# isort: off
 if vllm_version_is("0.13.0"):
-    from vllm.v1.attention.backends.utils import AttentionCGSupport, AttentionMetadataBuilder
    from vllm.attention.backends.abstract import (  # type: ignore
        AttentionBackend,
        AttentionImpl,
@@ -59,20 +57,21 @@ if vllm_version_is("0.13.0"):
        AttentionBackendEnum,
        register_backend,
    )
+    from vllm.v1.attention.backends.utils import AttentionCGSupport, AttentionMetadataBuilder
 else:
    from vllm.v1.attention.backend import (  # type: ignore
        AttentionBackend,
        AttentionCGSupport,
        AttentionImpl,
        AttentionLayer,
-        AttentionType,
        AttentionMetadataBuilder,
+        AttentionType,
    )
    from vllm.v1.attention.backends.registry import (  # type: ignore
        AttentionBackendEnum,
        register_backend,
    )
-# isort: on
+

 # default max value of sliding window size
 SWA_INT_MAX = 2147483647