[Test] Add e2e test cases for the Qwen-VL model adaptation to Ascend 310p (#6977)

### What this PR does / why we need it?
Add e2e test cases for the Qwen-VL model adaptation to Ascend 310p

- vLLM version: v0.16.0
- vLLM main:
15d76f74e2

Signed-off-by: gcw_61wqY8cy <wanghengkang1@huawei.com>
This commit is contained in:
wanghengkang
2026-03-06 14:25:10 +08:00
committed by GitHub
parent 620076b76a
commit c49ce18ea5
6 changed files with 141 additions and 2 deletions

View File

@@ -517,7 +517,8 @@ jobs:
PYTORCH_NPU_ALLOC_CONF: max_split_size_mb:256
VLLM_WORKER_MULTIPROC_METHOD: spawn
run: |
pytest -sv --durations=0 tests/e2e/310p/singlecard/test_dense_model_singlecard.py
pytest -sv --durations=0 tests/e2e/310p/singlecard/test_dense_model_singlecard.py \
tests/e2e/310p/singlecard/test_vl_model_singlecard.py
e2e_310p-4cards:
name: 310p multicards 4cards
@@ -577,4 +578,5 @@ jobs:
run: |
pytest -sv --durations=0 \
tests/e2e/310p/multicard/test_dense_model_multicard.py \
tests/e2e/310p/multicard/test_moe_model_multicard.py
tests/e2e/310p/multicard/test_moe_model_multicard.py \
tests/e2e/310p/multicard/test_vl_model_multicard.py