初始化项目,由ModelHub XC社区提供模型
Model: nv-community/AceMath-7B-Instruct Source: Original Platform
This commit is contained in:
10
evaluation/README.md
Normal file
10
evaluation/README.md
Normal file
@@ -0,0 +1,10 @@
|
||||
|
||||
|
||||
|
||||
## Introduction
|
||||
This is the evaluation script used to reproduce math benchmarks scores for AceMath-1.5B/7B/72B-Instruct models based on their outputs. The benchmark can be downloaded from [Qwen2.5-Math](https://github.com/QwenLM/Qwen2.5-Math/tree/main/evaluation/data).
|
||||
|
||||
## Calculate Scores
|
||||
```console
|
||||
python calculate_scores.py
|
||||
```
|
||||
Reference in New Issue
Block a user