xc-llm-ascend/Qwen3-VL-8B-Instruct.yaml at fff258bce17e05d787bfc5a225ed9d93db210f24 - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

ZengSilong dc1a6cb503 [Test]Add accuracy test for multiple models (#3823 )

### What this PR does / why we need it?
Add accuracy test for multiple models：
- Meta_Llama_3.1_8B_Instruct
- Qwen2.5-Omni-7B
- Qwen3-VL-8B-Instruct

- vLLM version: v0.11.0
- vLLM main:
83f478bb19

---------

Signed-off-by: MrZ20 <2609716663@qq.com>

2025-11-04 14:46:39 +08:00

12 lines

223 B

YAML

Raw Blame History

 model_name: "Qwen/Qwen3-VL-8B-Instruct"
 hardware: "Atlas A2 Series"
 model: "vllm-vlm"
 tasks:
 - name: "mmmu_val"
   metrics:
   - name: "acc,none"
     value: 0.55
 max_model_len: 8192
 batch_size: 32
 gpu_memory_utilization: 0.7