ModelHub XC 32352afeac 初始化项目,由ModelHub XC社区提供模型
Model: LTC-AI-Labs/L2-7b-Hermes-Synthia
Source: Original Platform
2026-05-18 02:55:31 +08:00

language, license, tags, datasets, pipeline_tag, model-index
language license tags datasets pipeline_tag model-index
en
llama2
roleplay
conversational
migtissera/Synthia-v1.3
open-llm-leaderboard/details_NousResearch__Nous-Hermes-llama-2-7b
text-generation
name results
L2-7b-Hermes-Synthia
task dataset metrics source
type name
text-generation Text Generation
name type config split args
AI2 Reasoning Challenge (25-Shot) ai2_arc ARC-Challenge test
num_few_shot
25
type value name
acc_norm 51.02 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type split args
HellaSwag (10-Shot) hellaswag validation
num_few_shot
10
type value name
acc_norm 79.12 normalized accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
MMLU (5-Shot) cais/mmlu all test
num_few_shot
5
type value name
acc 47.88 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
TruthfulQA (0-shot) truthful_qa multiple_choice validation
num_few_shot
0
type value
mc2 46.77
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
Winogrande (5-shot) winogrande winogrande_xl validation
num_few_shot
5
type value name
acc 74.51 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia Open LLM Leaderboard
task dataset metrics source
type name
text-generation Text Generation
name type config split args
GSM8k (5-shot) gsm8k main test
num_few_shot
5
type value name
acc 13.95 accuracy
url name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia Open LLM Leaderboard

Fine-tuned the synthia dataset on the hermes2 7b model

In my opinion it's probably the best model I fine-tuned in-terms of role-playing (tested on LavernAI)

Future plans:

  • I'll probably do more test in other areas

  • Will add other languages (Potentially japanese and chinese)

  • Finetune it on mistral models?

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 52.21
AI2 Reasoning Challenge (25-Shot) 51.02
HellaSwag (10-Shot) 79.12
MMLU (5-Shot) 47.88
TruthfulQA (0-shot) 46.77
Winogrande (5-shot) 74.51
GSM8k (5-shot) 13.95
Description
Model synced from source: LTC-AI-Labs/L2-7b-Hermes-Synthia
Readme 583 KiB