Files
zephyr-7b-truthy/README.md
ModelHub XC ffcd16fde5 初始化项目,由ModelHub XC社区提供模型
Model: vicgalle/zephyr-7b-truthy
Source: Original Platform
2026-04-11 11:22:58 +08:00

3.3 KiB

license, library_name, datasets, model-index
license library_name datasets model-index
apache-2.0 transformers
jondurbin/truthy-dpo-v0.1
name results
zephyr-7b-truthy
task dataset metrics source
type name
text-generation Text Generation
name type config split args
AI2 Reasoning Challenge (25-Shot) ai2_arc ARC-Challenge test
num_few_shot
25
type value name
acc_norm 60.75 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/zephyr-7b-truthy Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
HellaSwag (10-Shot) hellaswag validation
num_few_shot
10
type value name
acc_norm 84.64 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/zephyr-7b-truthy Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU (5-Shot) cais/mmlu all test
num_few_shot
5
type value name
acc 59.53 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/zephyr-7b-truthy Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
TruthfulQA (0-shot) truthful_qa multiple_choice validation
num_few_shot
0
type value
mc2 63.31
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/zephyr-7b-truthy Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
Winogrande (5-shot) winogrande winogrande_xl validation
num_few_shot
5
type value name
acc 77.9 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/zephyr-7b-truthy Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
GSM8k (5-shot) gsm8k main test
num_few_shot
5
type value name
acc 25.47 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/zephyr-7b-truthy Open LLM Leaderboard

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 61.93
AI2 Reasoning Challenge (25-Shot) 60.75
HellaSwag (10-Shot) 84.64
MMLU (5-Shot) 59.53
TruthfulQA (0-shot) 63.31
Winogrande (5-shot) 77.90
GSM8k (5-shot) 25.47