Files
ModelHub XC dc3ae8f5a4 初始化项目,由ModelHub XC社区提供模型
Model: nv-community/AceMath-1.5B-Instruct
Source: Original Platform
2026-06-09 22:29:14 +08:00
..

Introduction

This is the evaluation script used to reproduce math benchmarks scores for AceMath-1.5B/7B/72B-Instruct models based on their outputs. The benchmark can be downloaded from Qwen2.5-Math.

Calculate Scores

python calculate_scores.py