license, tags, model-index
| license |
tags |
model-index |
| apache-2.0 |
| merge |
| mergekit |
| lazymergekit |
| dvilasuero/DistilabelBeagle14-7B |
| teknium/OpenHermes-2.5-Mistral-7B |
|
| name |
results |
| DistilabelCerberus-7B-slerp |
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| AI2 Reasoning Challenge (25-Shot) |
ai2_arc |
ARC-Challenge |
test |
|
|
| type |
value |
name |
| acc_norm |
68.17 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
split |
args |
| HellaSwag (10-Shot) |
hellaswag |
validation |
|
|
| type |
value |
name |
| acc_norm |
86.78 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| MMLU (5-Shot) |
cais/mmlu |
all |
test |
|
|
| type |
value |
name |
| acc |
64.2 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| TruthfulQA (0-shot) |
truthful_qa |
multiple_choice |
validation |
|
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| Winogrande (5-shot) |
winogrande |
winogrande_xl |
validation |
|
|
| type |
value |
name |
| acc |
79.48 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| GSM8k (5-shot) |
gsm8k |
main |
test |
|
|
| type |
value |
name |
| acc |
69.83 |
accuracy |
|
|
|
|
|
|
|
DistilabelCerberus-7B-slerp
DistilabelCerberus-7B-slerp is a merge of the following models using mergekit:
🧩 Configuration
Results
|
ARC-C |
Hellaswag |
ThruthfulQA |
Winogrande |
GSM8K |
|
|
| OpenHermes-2.5-Mistral-7B |
61.26 |
65.22 |
52.24 |
78.06 |
26.08 |
|
|
| DistilabelBeagle14-7B |
? |
? |
71.66 |
? |
? |
|
|
| DistilabelCerberus-7B-slerp |
65.44 |
69.29 |
60.93 |
79.48 |
69.82 |
|
|
Detailed results can be found here
| Metric |
Value |
| Avg. |
71.56 |
| AI2 Reasoning Challenge (25-Shot) |
68.17 |
| HellaSwag (10-Shot) |
86.78 |
| MMLU (5-Shot) |
64.20 |
| TruthfulQA (0-shot) |
60.93 |
| Winogrande (5-shot) |
79.48 |
| GSM8k (5-shot) |
69.83 |