xc-llm-ascend/tests/e2e/singlecard/models/configs/Qwen3-30B-A3B.yaml

model_name: "Qwen/Qwen3-30B-A3B"
tasks:
- name: "gsm8k"
  metrics:
  - name: "exact_match,strict-match"
    value: 0.89
  - name: "exact_match,flexible-extract"
    value: 0.85
- name: "ceval-valid"
  metrics:
  - name: "acc,none"
    value: 0.84
num_fewshot: 5
gpu_memory_utilization: 0.6
enable_expert_parallel: True
tensor_parallel_size: 2
apply_chat_template: False
fewshot_as_multiturn: False
Enable pytest and yaml style accuracy test (#2073) ### What this PR does / why we need it? This PR enabled pytest and yaml style accuracy test, users now can enable accuracy test by running: ```bash cd ~/vllm-ascend pytest -sv ./tests/e2e/singlecard/models/test_lm_eval_correctness.py \ --config ./tests/e2e/singlecard/models/configs/Qwen3-8B-Base.yaml \ --report_output ./benchmarks/accuracy/Qwen3-8B-Base.md pytest -sv ./tests/e2e/singlecard/models/test_lm_eval_correctness.py \ --config-list-file ./tests/e2e/singlecard/models/configs/accuracy.txt ``` Closes: https://github.com/vllm-project/vllm-ascend/issues/1970 ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? - vLLM version: v0.10.0 - vLLM main: https://github.com/vllm-project/vllm/commit/2836dd73f13015ee386c544760ca0d16888203f3 --------- Signed-off-by: Icey <1790571317@qq.com> 2025-07-31 21:39:13 +08:00			`model_name: "Qwen/Qwen3-30B-A3B"`
			`tasks:`
			`- name: "gsm8k"`
			`metrics:`
			`- name: "exact_match,strict-match"`
			`value: 0.89`
			`- name: "exact_match,flexible-extract"`
			`value: 0.85`
			`- name: "ceval-valid"`
			`metrics:`
			`- name: "acc,none"`
			`value: 0.84`
			`num_fewshot: 5`
			`gpu_memory_utilization: 0.6`
			`enable_expert_parallel: True`
			`tensor_parallel_size: 2`
			`apply_chat_template: False`
			`fewshot_as_multiturn: False`