ModelHub XC ecc3d1edb6 初始化项目,由ModelHub XC社区提供模型
Model: dhanushreddy29/BrokenKeyboard
Source: Original Platform
2026-04-13 09:40:59 +08:00

language, license, datasets, base_model, model-index
language license datasets base_model model-index
en
cc-by-nc-4.0
argilla/distilabel-intel-orca-dpo-pairs
upstage/SOLAR-10.7B-Instruct-v1.0
name results
BrokenKeyboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
AI2 Reasoning Challenge (25-Shot) ai2_arc ARC-Challenge test
num_few_shot
25
type value name
acc_norm 71.25 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=dhanushreddy29/BrokenKeyboard Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
HellaSwag (10-Shot) hellaswag validation
num_few_shot
10
type value name
acc_norm 88.34 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=dhanushreddy29/BrokenKeyboard Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU (5-Shot) cais/mmlu all test
num_few_shot
5
type value name
acc 66.04 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=dhanushreddy29/BrokenKeyboard Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
TruthfulQA (0-shot) truthful_qa multiple_choice validation
num_few_shot
0
type value
mc2 71.36
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=dhanushreddy29/BrokenKeyboard Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
Winogrande (5-shot) winogrande winogrande_xl validation
num_few_shot
5
type value name
acc 83.19 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=dhanushreddy29/BrokenKeyboard Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
GSM8k (5-shot) gsm8k main test
num_few_shot
5
type value name
acc 64.29 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=dhanushreddy29/BrokenKeyboard Open LLM Leaderboard

Model Card for Model ID

Just testing out LLM Finetuning. Finetuned on upstage/SOLAR-10.7B-Instruct-v1.0 using argilla/distilabel-intel-orca-dpo-pairs. Followed the Google Colab mentioned in this article: https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 74.08
AI2 Reasoning Challenge (25-Shot) 71.25
HellaSwag (10-Shot) 88.34
MMLU (5-Shot) 66.04
TruthfulQA (0-shot) 71.36
Winogrande (5-shot) 83.19
GSM8k (5-shot) 64.29
Description
Model synced from source: dhanushreddy29/BrokenKeyboard
Readme 564 KiB