Files

ModelHub XC c5ceab1eb2 初始化项目，由ModelHub XC社区提供模型

Model: AbacusResearch/haLLawa4-7b
Source: Original Platform

2026-05-14 09:50:43 +08:00

4.2 KiB

Raw Permalink Blame History

license, tags, model-index

license

tags

model-index

apache-2.0

merge

mergekit

lazymergekit

mlabonne/Monarch-7B

paulml/OGNO-7B

AbacusResearch/haLLAwa3

name

results

haLLawa4-7b

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	71.5	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=AbacusResearch/haLLawa4-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	88.36	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=AbacusResearch/haLLawa4-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	64.49	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=AbacusResearch/haLLawa4-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	74.27

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=AbacusResearch/haLLawa4-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	82.4	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=AbacusResearch/haLLawa4-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	70.51	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=AbacusResearch/haLLawa4-7b	Open LLM Leaderboard

haLLawa4-7b

haLLawa4-7b is a merge of the following models using mergekit:

🧩 Configuration

```yaml models:

model: eren23/ogno-monarch-jaskier-merge-7b
No parameters necessary for base model
model: mlabonne/Monarch-7B #Emphasize the beginning of Vicuna format models parameters: weight: 0.5 density: 0.59
model: paulml/OGNO-7B parameters: weight: 0.2 density: 0.55

Vicuna format

model: AbacusResearch/haLLAwa3 parameters: weight: 0.3 density: 0.55

merge_method: dare_ties base_model: eren23/ogno-monarch-jaskier-merge-7b parameters: int8_mask: true dtype: bfloat16 random_seed: 0 ```

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	75.25
AI2 Reasoning Challenge (25-Shot)	71.50
HellaSwag (10-Shot)	88.36
MMLU (5-Shot)	64.49
TruthfulQA (0-shot)	74.27
Winogrande (5-shot)	82.40
GSM8k (5-shot)	70.51

4.2 KiB Raw Permalink Blame History

haLLawa4-7b

🧩 Configuration

No parameters necessary for base model

Vicuna format

Open LLM Leaderboard Evaluation Results

4.2 KiB

Raw Permalink Blame History