Go to file

ModelHub XC 8940283e4d 初始化项目，由ModelHub XC社区提供模型

Model: Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct
Source: Original Platform

2026-05-09 19:35:25 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

configuration.json

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

model-00001-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

model-00002-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

model-00003-of-00003.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

tokenizer.model

初始化项目，由ModelHub XC社区提供模型

2026-05-09 19:35:25 +08:00

README.md

license, tags, model-index

license

tags

model-index

cc-by-nc-4.0

merge

name

results

SauerkrautLM-UNA-SOLAR-Instruct

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	70.9	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	88.3	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	66.15	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	71.8

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	83.74	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	64.67	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Weyaxi/SauerkrautLM-UNA-SOLAR-Instruct	Open LLM Leaderboard

SauerkrautLM-UNA-SOLAR-Instruct

This is the model for SauerkrautLM-UNA-SOLAR-Instruct. I used mergekit to merge models.

🥳 As of December 24 2023, this model holds the first place position on the Open LLM Leaderboard.

Screenshot

Prompt Template(s)

### User:
{user}

### Assistant:
{asistant}

Yaml Config to reproduce

slices:
  - sources:
      - model: VAGOsolutions/SauerkrautLM-SOLAR-Instruct
        layer_range: [0, 48]
      - model: fblgit/UNA-SOLAR-10.7B-Instruct-v1.0
        layer_range: [0, 48]

merge_method: slerp
base_model: upstage/SOLAR-10.7B-Instruct-v1.0

parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5 # fallback for rest of tensors
tokenizer_source: union

dtype: bfloat16

Quantizationed versions

Quantizationed versions of this model is available thanks to TheBloke.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	74.26
AI2 Reasoning Challenge (25-Shot)	70.90
HellaSwag (10-Shot)	88.30
MMLU (5-Shot)	66.15
TruthfulQA (0-shot)	71.80
Winogrande (5-shot)	83.74
GSM8k (5-shot)	64.67

If you would like to support me:

☕ Buy Me a Coffee

README.md

SauerkrautLM-UNA-SOLAR-Instruct

Screenshot

Prompt Template(s)

Yaml Config to reproduce

Quantizationed versions

GPTQ

GGUF

AWQ

Open LLM Leaderboard Evaluation Results