[Test] Temporarily skip flaky ACL graph test (#3577)
### What this PR does / why we need it? Disables `FULL_DECODE_ONLY` end-to-end test that fails intermittently. This prevents CI blockages while the root cause of the flakiness is investigated. ### Does this PR introduce _any_ user-facing change? None. ### How was this patch tested? None needed. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>
This commit is contained in:
@@ -100,6 +100,8 @@ def test_models_with_aclgraph(
|
||||
)
|
||||
|
||||
|
||||
@pytest.mark.skip("Skipping this test for now, "
|
||||
"it fails intermittently and needs investigation.")
|
||||
@pytest.mark.parametrize("model", MODELS)
|
||||
@pytest.mark.parametrize("max_tokens", [5])
|
||||
def test_models_with_aclgraph_full_decode_only(
|
||||
|
||||
Reference in New Issue
Block a user