[feature] support pcp + mtp in full graph (#4572)
1. support pcp + mtp in full graph
2. pcp/dcp related mtp bugfix
3. support pcp + mtpx
- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c
Signed-off-by: zhangsicheng5 <zhangsicheng5@huawei.com>
This commit is contained in:
1
.github/workflows/_e2e_test.yaml
vendored
1
.github/workflows/_e2e_test.yaml
vendored
@@ -269,6 +269,7 @@ jobs:
|
||||
pytest -sv --durations=0 tests/e2e/multicard/test_data_parallel_tp2.py
|
||||
pytest -sv --durations=0 tests/e2e/multicard/long_sequence/test_basic.py
|
||||
pytest -sv --durations=0 tests/e2e/multicard/long_sequence/test_accuracy.py
|
||||
pytest -sv --durations=0 tests/e2e/multicard/long_sequence/test_mtp.py
|
||||
|
||||
- name: Install Ascend toolkit & triton_ascend (for Qwen3-Next-80B-A3B-Instruct)
|
||||
shell: bash -l {0}
|
||||
|
||||
Reference in New Issue
Block a user