xc-llm-ascend/Qwen3-8B-Base.yaml at af04ee9e7a53dfab5ddbcee5535e8645cc84c169 - xc-llm-ascend - Gitea: Git with a cup of tea

EngineX/xc-llm-ascend

Files

Icey 86bdde1ca8 Enable pytest and yaml style accuracy test (#2073 )

### What this PR does / why we need it?

This PR enabled pytest and yaml style accuracy test, users now can
enable accuracy test by running:

```bash
cd ~/vllm-ascend
pytest -sv ./tests/e2e/singlecard/models/test_lm_eval_correctness.py \
          --config ./tests/e2e/singlecard/models/configs/Qwen3-8B-Base.yaml \
          --report_output ./benchmarks/accuracy/Qwen3-8B-Base.md

pytest -sv ./tests/e2e/singlecard/models/test_lm_eval_correctness.py \
          --config-list-file ./tests/e2e/singlecard/models/configs/accuracy.txt
```

Closes: https://github.com/vllm-project/vllm-ascend/issues/1970

### Does this PR introduce _any_ user-facing change?
no

### How was this patch tested?


- vLLM version: v0.10.0
- vLLM main:
2836dd73f1

---------

Signed-off-by: Icey <1790571317@qq.com>

2025-07-31 21:39:13 +08:00

14 lines

262 B

YAML

Raw Blame History

 model_name: "Qwen/Qwen3-8B-Base"
 tasks:
 - name: "gsm8k"
   metrics:
   - name: "exact_match,strict-match"
     value: 0.82
   - name: "exact_match,flexible-extract"
     value: 0.83
 - name: "ceval-valid"
   metrics:
   - name: "acc,none"
     value: 0.82
 num_fewshot: 5