language, license, tags, base_model, model-index
| language |
license |
tags |
base_model |
model-index |
|
|
cc-by-nc-4.0 |
| merge |
| lazymergekit |
| LDCC/LDCC-SOLAR-10.7B |
| upstage/SOLAR-10.7B-Instruct-v1.0 |
|
| LDCC/LDCC-SOLAR-10.7B |
| upstage/SOLAR-10.7B-Instruct-v1.0 |
|
| name |
results |
| SOLAR-10.7B-slerp |
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| AI2 Reasoning Challenge (25-Shot) |
ai2_arc |
ARC-Challenge |
test |
|
|
| type |
value |
name |
| acc_norm |
68.17 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
split |
args |
| HellaSwag (10-Shot) |
hellaswag |
validation |
|
|
| type |
value |
name |
| acc_norm |
86.91 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| MMLU (5-Shot) |
cais/mmlu |
all |
test |
|
|
| type |
value |
name |
| acc |
66.73 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| TruthfulQA (0-shot) |
truthful_qa |
multiple_choice |
validation |
|
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| Winogrande (5-shot) |
winogrande |
winogrande_xl |
validation |
|
|
| type |
value |
name |
| acc |
84.06 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| GSM8k (5-shot) |
gsm8k |
main |
test |
|
|
| type |
value |
name |
| acc |
62.17 |
accuracy |
|
|
|
|
|
|
|
SOLAR-10.7B-slerp
SOLAR-10.7B-slerp is a merge of the following models using mergekit:
Github
https://github.com/sunjin7725/SOLAR-10.7b-slerp
Benchmark
Open-Ko-LLM-Leaderboard
| Average |
Ko-ARC |
Ko-HellaSwag |
Ko-MMLU |
Ko-TruthfulQA |
Ko-CommonGen V2 |
| 56.93 |
53.58 |
62.03 |
53.31 |
57.16 |
58.56 |
How to use
🧩 Configuration
Detailed results can be found here
| Metric |
Value |
| Avg. |
72.58 |
| AI2 Reasoning Challenge (25-Shot) |
68.17 |
| HellaSwag (10-Shot) |
86.91 |
| MMLU (5-Shot) |
66.73 |
| TruthfulQA (0-shot) |
67.42 |
| Winogrande (5-shot) |
84.06 |
| GSM8k (5-shot) |
62.17 |