xc-llm-ascend/Qwen3-VL-30B-A3B-Instruct.yaml at 07014e2101ce5bd2d9d3198f2e5fab9f2717975a - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

shaopeng-666 39bdd4cfaa fix profile run for vl model (#5136 )

### What this PR does / why we need it?

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

---------

Signed-off-by: 李少鹏 <lishaopeng21@huawei.com>

2025-12-17 23:51:31 +08:00

12 lines

246 B

YAML

Raw Blame History

 model_name: "Qwen/Qwen3-VL-30B-A3B-Instruct"
 hardware: "Atlas A2 Series"
 model: "vllm-vlm"
 tasks:
 - name: "mmmu_val"
   metrics:
   - name: "acc,none"
     value: 0.58
 tensor_parallel_size: 2
 gpu_memory_utilization: 0.7
 enable_expert_parallel: True