ModelHub XC 62cf9e933e 初始化项目,由ModelHub XC社区提供模型
Model: KnutJaegersberg/Deita-4b
Source: Original Platform
2026-06-01 17:48:23 +08:00

license, datasets, license_name, license_link, model-index
license datasets license_name license_link model-index
other
KnutJaegersberg/Deita-6k
qwen LICENSE
name results
Deita-4b
task dataset metrics source
type name
text-generation Text Generation
name type config split args
AI2 Reasoning Challenge (25-Shot) ai2_arc ARC-Challenge test
num_few_shot
25
type value name
acc_norm 46.08 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-4b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
HellaSwag (10-Shot) hellaswag validation
num_few_shot
10
type value name
acc_norm 71.81 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-4b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU (5-Shot) cais/mmlu all test
num_few_shot
5
type value name
acc 55.46 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-4b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
TruthfulQA (0-shot) truthful_qa multiple_choice validation
num_few_shot
0
type value
mc2 50.23
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-4b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
Winogrande (5-shot) winogrande winogrande_xl validation
num_few_shot
5
type value name
acc 66.14 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-4b Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
GSM8k (5-shot) gsm8k main test
num_few_shot
5
type value name
acc 48.9 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-4b Open LLM Leaderboard

Prompt Example:

### System:
You are an AI assistant. User will give you a task. Your goal is to complete the task as faithfully as you can. While performing the task think step-by-step and justify your steps.
### User: 
How do you fine tune a large language model? 
### Assistant:

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 56.43
AI2 Reasoning Challenge (25-Shot) 46.08
HellaSwag (10-Shot) 71.81
MMLU (5-Shot) 55.46
TruthfulQA (0-shot) 50.23
Winogrande (5-shot) 66.14
GSM8k (5-shot) 48.90
Description
Model synced from source: KnutJaegersberg/Deita-4b
Readme 4.2 MiB