### What this PR does / why we need it?
Refactor the e2e testcases.
- tests/e2e/multicard/test_weight_loader.py: Remove the unused code.
- tests/e2e/singlecard/multi-modal/test_internvl.py: Move to accuracy
test.
- tests/e2e/singlecard/test_aclgraph.py: Rename the file.
- tests/e2e/singlecard/test_embedding_aclgraph.py : Combine with
tests/e2e/singlecard/test_bge_model.py
- tests/e2e/singlecard/test_completion_with_prompt_embeds.py: Delete
eager mode and modify model to Qwen3-0.6B
- tests/e2e/singlecard/test_quantization.py: Modify model to
Qwen3-0.6B-W8A8
- tests/e2e/singlecard/test_vlm.py: Modify model to Qwen3-VL-8B
- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c
---------
Signed-off-by: menogrey <1299267905@qq.com>
18 lines
366 B
Plaintext
18 lines
366 B
Plaintext
Qwen3-30B-A3B.yaml
|
|
Qwen3-8B.yaml
|
|
Qwen2-7B.yaml
|
|
Qwen2-Audio-7B-Instruct.yaml
|
|
Qwen3-VL-30B-A3B-Instruct.yaml
|
|
Qwen3-VL-8B-Instruct.yaml
|
|
Qwen2.5-Omni-7B.yaml
|
|
Meta-Llama-3.1-8B-Instruct.yaml
|
|
InternVL2-8B.yaml
|
|
InternVL2_5-8B.yaml
|
|
InternVL3-8B.yaml
|
|
InternVL3_5-8B.yaml
|
|
ERNIE-4.5-21B-A3B-PT.yaml
|
|
gemma-3-4b-it.yaml
|
|
internlm-7b.yaml
|
|
Molmo-7B-D-0924.yaml
|
|
llava-1.5-7b-hf.yaml
|