xc-llm-ascend/gemma-3-4b-it.yaml at 70606e0bb93cc23c1f8d5dfb1b681bd24e66d2ab - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

SILONG ZENG 70606e0bb9 [Test]update accuracy test of models (#4911 )

### What this PR does / why we need it?
Delete accuracy tests for models that are no longer retained：
- Meta-Llama-3.1-8B-Instruct
- llava-1.5-7b-hf
- InternVL2-8B.yaml
- InternVL2_5-8B.yaml
- InternVL3-8B.yaml

Add accuracy tests for the new models：
- Llama-3.2-3B-Instruct
- llava-onevision-qwen2-0.5b-ov-hf
- Qwen3-VL-30B-A3B-Instruct

- vLLM version: v0.12.0
- vLLM main:
ad32e3e19c

---------

Signed-off-by: MrZ20 <2609716663@qq.com>

2025-12-15 15:04:20 +08:00

15 lines

331 B

YAML

Raw Blame History

 model_name: "LLM-Research/gemma-3-4b-it"
 hardware: "Atlas A2 Series"
 tasks:
 - name: "gsm8k"
   metrics:
   - name: "exact_match,strict-match"
     value: 0.59
   - name: "exact_match,flexible-extract"
     value: 0.59
 num_fewshot: 5
 apply_chat_template: False
 fewshot_as_multiturn: False
 gpu_memory_utilization: 0.7
 enforce_eager: True