Files
macaroni-7b/README.md
ModelHub XC 45cf3cca4c 初始化项目,由ModelHub XC社区提供模型
Model: andrijdavid/macaroni-7b
Source: Original Platform
2026-06-15 02:54:17 +08:00

4.8 KiB

language, license, tags, model-index
language license tags model-index
en
apache-2.0
mistral
merge
name results
macaroni-7b
task dataset metrics source
type name
text-generation Text Generation
name type config split args
AI2 Reasoning Challenge (25-Shot) ai2_arc ARC-Challenge test
num_few_shot
25
type value name
acc_norm 73.12 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
HellaSwag (10-Shot) hellaswag validation
num_few_shot
10
type value name
acc_norm 88.17 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU (5-Shot) cais/mmlu all test
num_few_shot
5
type value name
acc 64.58 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
TruthfulQA (0-shot) truthful_qa multiple_choice validation
num_few_shot
0
type value
mc2 68.76
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
Winogrande (5-shot) winogrande winogrande_xl validation
num_few_shot
5
type value name
acc 84.37 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
GSM8k (5-shot) gsm8k main test
num_few_shot
5
type value name
acc 68.61 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=andrijdavid/macaroni-7b Open LLM Leaderboard

Macaroni 7B

This is an experimental merge of pre-trained mistral language models with fblgit/UNA-TheBeagle-7b-v1.

Disclaimer

  • No Warranty: The Model is provided on an "AS IS" basis, without warranty of any kind. The entire risk as to the quality, performance and use of The Model is with the user.

  • Limitation of Liability: In no event shall the creator(s) of The Model be liable for any claim, damages, or other liability, whether in an action of contract, tort or otherwise, arising from, out of, or in connection with The Model or the use or other dealings in The Model.

  • Accuracy and Risks: The creator(s) do not warrant that The Model is free from errors or inaccuracies and disclaim any responsibility for any harm resulting from the use of The Model.

  • Use at Your Own Risk: Users are solely responsible for any consequences resulting from the use of The Model, including but not limited to any changes made to The Model by the user or the results produced by The Model.

  • Compliance with Laws: Users are solely responsible for ensuring that their use of The Model complies with all applicable laws, regulations, and policies.

  • Ethical Use: Users are encouraged to use The Model ethically and responsibly. The creator(s) disclaim any responsibility for misuse or unethical use of The Model.

  • Modifications: Any modifications made to The Model by third parties are the sole responsibility of the party making the modifications. The original creator(s) of The Model shall not be responsible for any modifications made by third parties.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 74.60
AI2 Reasoning Challenge (25-Shot) 73.12
HellaSwag (10-Shot) 88.17
MMLU (5-Shot) 64.58
TruthfulQA (0-shot) 68.76
Winogrande (5-shot) 84.37
GSM8k (5-shot) 68.61