Files
Deita-1_8B/README.md
ModelHub XC 6b0371c6af 初始化项目,由ModelHub XC社区提供模型
Model: KnutJaegersberg/Deita-1_8B
Source: Original Platform
2026-06-08 12:27:43 +08:00

3.6 KiB

license, license_name, license_link, model-index
license license_name license_link model-index
other qwen LICENSE
name results
Deita-1_8B
task dataset metrics source
type name
text-generation Text Generation
name type config split args
AI2 Reasoning Challenge (25-Shot) ai2_arc ARC-Challenge test
num_few_shot
25
type value name
acc_norm 36.52 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-1_8B Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
HellaSwag (10-Shot) hellaswag validation
num_few_shot
10
type value name
acc_norm 60.63 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-1_8B Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU (5-Shot) cais/mmlu all test
num_few_shot
5
type value name
acc 45.62 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-1_8B Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
TruthfulQA (0-shot) truthful_qa multiple_choice validation
num_few_shot
0
type value
mc2 40.02
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-1_8B Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
Winogrande (5-shot) winogrande winogrande_xl validation
num_few_shot
5
type value name
acc 59.35 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-1_8B Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
GSM8k (5-shot) gsm8k main test
num_few_shot
5
type value name
acc 15.62 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=KnutJaegersberg/Deita-1_8B Open LLM Leaderboard

Their noncommercial license applies.

Prompt Example:

### System:
You are an AI assistant. User will give you a task. Your goal is to complete the task as faithfully as you can. While performing the task think step-by-step and justify your steps.
### User: 
How do you fine tune a large language model? 
### Assistant:

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 42.96
AI2 Reasoning Challenge (25-Shot) 36.52
HellaSwag (10-Shot) 60.63
MMLU (5-Shot) 45.62
TruthfulQA (0-shot) 40.02
Winogrande (5-shot) 59.35
GSM8k (5-shot) 15.62