license
license
cc-by-nc-sa-4.0

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 25.35
ARC (25-shot) 22.27
HellaSwag (10-shot) 28.99
MMLU (5-shot) 26.62
TruthfulQA (0-shot) 41.71
Winogrande (5-shot) 52.72
GSM8K (5-shot) 0.23
DROP (3-shot) 4.93
Description
Model synced from source: Corianas/256_5epoch
Readme 1.3 MiB
Languages
Roff 100%