[E2E] add E2E for Prefix Caching cp & Chunked Prefill cp (#5149)
### What this PR does / why we need it?
Add E2E for Prefix Caching cp & Chunked Prefill cp
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
- vLLM version: v0.15.0
- vLLM main:
ad32e3e19c
---------
Signed-off-by: F.Liu <liufeng248@huawei.com>
Signed-off-by: Feng Liu <46866849+ader47@users.noreply.github.com>
Co-authored-by: F.Liu <liufeng248@huawei.com>
This commit is contained in:
4
.github/workflows/scripts/config.yaml
vendored
4
.github/workflows/scripts/config.yaml
vendored
@@ -139,7 +139,9 @@ e2e-multicard-4-cards:
|
||||
estimated_time: 60
|
||||
- name: tests/e2e/multicard/4-cards/long_sequence/test_basic.py
|
||||
estimated_time: 60
|
||||
- name: tests/e2e/multicard/4-cards/long_sequence/test_chunked_prefill.py
|
||||
- name: tests/e2e/multicard/4-cards/long_sequence/test_chunked_prefill_cp.py
|
||||
estimated_time: 60
|
||||
- name: tests/e2e/multicard/4-cards/long_sequence/test_prefix_caching_cp.py
|
||||
estimated_time: 60
|
||||
- name: tests/e2e/multicard/4-cards/long_sequence/test_mtp.py
|
||||
estimated_time: 60
|
||||
|
||||
Reference in New Issue
Block a user