xc-llm-ascend/Qwen3-8B-Base.yaml at de7649492ddcbdb7c818665f0b81cc8fbaaaa4b7 - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

Icey 0bd5ff5299 Fix accuracy test config and add DeepSeek-V2-Lite test (#2261 )

### What this PR does / why we need it?
This PR fix accuracy test related to
https://github.com/vllm-project/vllm-ascend/pull/2073, users can now
perform accuracy tests on multiple models simultaneously and generate
different report files by running:

```bash
cd ~/vllm-ascend
pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py \
          --config-list-file ./tests/e2e/models/configs/accuracy.txt
```

### Does this PR introduce _any_ user-facing change?
no

### How was this patch tested?
<img width="1648" height="511" alt="image"
src="https://github.com/user-attachments/assets/1757e3b8-a6b7-44e5-b701-80940dc756cd"
/>


- vLLM version: v0.10.0
- vLLM main:
766bc8162c

---------

Signed-off-by: Icey <1790571317@qq.com>

2025-08-08 11:09:16 +08:00

14 lines

262 B

YAML

Raw Blame History

 model_name: "Qwen/Qwen3-8B-Base"
 tasks:
 - name: "gsm8k"
   metrics:
   - name: "exact_match,strict-match"
     value: 0.82
   - name: "exact_match,flexible-extract"
     value: 0.83
 - name: "ceval-valid"
   metrics:
   - name: "acc,none"
     value: 0.82
 num_fewshot: 5