Files
ModelHub XC 20bd54f55d 初始化项目,由ModelHub XC社区提供模型
Model: Weyaxi/Airboros2.1-Platypus2-13B-QLora-0.80-epoch
Source: Original Platform
2026-06-01 18:11:16 +08:00

898 B

Buy Me A Coffee

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 51.17
ARC (25-shot) 58.96
HellaSwag (10-shot) 82.46
MMLU (5-shot) 54.62
TruthfulQA (0-shot) 47.71
Winogrande (5-shot) 75.14
GSM8K (5-shot) 0.0
DROP (3-shot) 39.32