### What this PR does / why we need it?
1. Provide accuracy test report for development branch release.
2. Models and datasets for accuracy test:
| Model | datasets |
|---------------------------- | --------------------------- |
| Qwen2.5-7B-Instruct | ceval-val, gsm8k, mmlu |
| Qwen3-8B | ceval-val, gsm8k, mmlu |
| Llama-3.1-8B-Instruct | ceval-val, gsm8k, mmlu |
| Qwen2.5-VL-7B-Instruct | mmmu_val |
### Does this PR introduce _any_ user-facing change?
This PR will display the accuracy test report of the release versionin
docs/source/developer_guide/accuracy_report。
Qwen2.5-7B-Instruct.md
Qwen3-8B.md
Llama-3.1-8B-Instruct.md
Qwen2.5-VL-7B-Instruct .md
Signed-off-by: hfadzxy <starmoon_zhang@163.com>
15 lines
185 B
Markdown
15 lines
185 B
Markdown
# Evaluation
|
|
|
|
:::{toctree}
|
|
:caption: Accuracy
|
|
:maxdepth: 1
|
|
using_lm_eval
|
|
using_opencompass
|
|
using_evalscope
|
|
:::
|
|
|
|
:::{toctree}
|
|
:caption: Performance
|
|
:maxdepth: 1
|
|
performance_benchmark
|
|
::: |