[Test] Add initial multi modal cases of Qwen2.5-VL-7B-Instruct for disaggregated encoder (#5301)
### What this PR does / why we need it? This PR adds disaggregated encoder tests for Qwen2.5-VL-7B-Instruct ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test by running ci - vLLM version: release/v0.12.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com>
This commit is contained in:
@@ -126,6 +126,9 @@ jobs:
|
||||
- name: qwen2-5-vl-7b
|
||||
os: linux-aarch64-a3-4
|
||||
tests: tests/e2e/nightly/single_node/models/test_qwen2_5_vl_7b.py
|
||||
- name: qwen2-5-vl-7b-epd
|
||||
os: linux-aarch64-a3-4
|
||||
tests: tests/e2e/nightly/single_node/models/test_qwen2_5_vl_7b_epd.py
|
||||
- name: qwen2-5-vl-32b
|
||||
os: linux-aarch64-a3-4
|
||||
tests: tests/e2e/nightly/single_node/models/test_qwen2_5_vl_32b.py
|
||||
|
||||
2
.github/workflows/scripts/config.yaml
vendored
2
.github/workflows/scripts/config.yaml
vendored
@@ -126,6 +126,8 @@ e2e-multicard-2-cards:
|
||||
estimated_time: 1050
|
||||
- name: tests/e2e/multicard/2-cards/test_single_request_aclgraph.py
|
||||
estimated_time: 215
|
||||
- name: tests/e2e/multicard/2-cards/test_disaggregated_encoder.py
|
||||
estimated_time: 90
|
||||
|
||||
e2e-multicard-4-cards:
|
||||
# TODO: recover skipped tests
|
||||
|
||||
Reference in New Issue
Block a user