Files

ModelHub XC 45cf3cca4c 初始化项目，由ModelHub XC社区提供模型

Model: andrijdavid/macaroni-7b
Source: Original Platform

2026-06-15 02:54:17 +08:00

4.8 KiB

Raw Permalink Blame History

language, license, tags, model-index

language

license

tags

model-index

apache-2.0

mistral

merge

name

results

macaroni-7b

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	73.12	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	88.17	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	64.58	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	68.76

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	84.37	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	68.61	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b	Open LLM Leaderboard

Macaroni 7B

This is an experimental merge of pre-trained mistral language models with fblgit/UNA-TheBeagle-7b-v1.

Disclaimer

No Warranty: The Model is provided on an "AS IS" basis, without warranty of any kind. The entire risk as to the quality, performance and use of The Model is with the user.
Limitation of Liability: In no event shall the creator(s) of The Model be liable for any claim, damages, or other liability, whether in an action of contract, tort or otherwise, arising from, out of, or in connection with The Model or the use or other dealings in The Model.
Accuracy and Risks: The creator(s) do not warrant that The Model is free from errors or inaccuracies and disclaim any responsibility for any harm resulting from the use of The Model.
Use at Your Own Risk: Users are solely responsible for any consequences resulting from the use of The Model, including but not limited to any changes made to The Model by the user or the results produced by The Model.
Compliance with Laws: Users are solely responsible for ensuring that their use of The Model complies with all applicable laws, regulations, and policies.
Ethical Use: Users are encouraged to use The Model ethically and responsibly. The creator(s) disclaim any responsibility for misuse or unethical use of The Model.
Modifications: Any modifications made to The Model by third parties are the sole responsibility of the party making the modifications. The original creator(s) of The Model shall not be responsible for any modifications made by third parties.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	74.60
AI2 Reasoning Challenge (25-Shot)	73.12
HellaSwag (10-Shot)	88.17
MMLU (5-Shot)	64.58
TruthfulQA (0-shot)	68.76
Winogrande (5-shot)	84.37
GSM8k (5-shot)	68.61

4.8 KiB Raw Permalink Blame History

Macaroni 7B

Disclaimer

Open LLM Leaderboard Evaluation Results

4.8 KiB

Raw Permalink Blame History