Go to file

ModelHub XC 32352afeac 初始化项目，由ModelHub XC社区提供模型

Model: LTC-AI-Labs/L2-7b-Hermes-Synthia
Source: Original Platform

2026-05-18 02:55:31 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

added_tokens.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

model-00001-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

model-00002-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

pytorch_model-00001-of-00002.bin

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

pytorch_model-00002-of-00002.bin

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

pytorch_model.bin.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-18 02:55:31 +08:00

README.md

language, license, tags, datasets, pipeline_tag, model-index

language

license

tags

datasets

pipeline_tag

model-index

llama2

roleplay

conversational

migtissera/Synthia-v1.3

open-llm-leaderboard/details_NousResearch__Nous-Hermes-llama-2-7b

text-generation

name

results

L2-7b-Hermes-Synthia

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	51.02	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	79.12	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	47.88	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	46.77

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	74.51	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	13.95	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=LTC-AI-Labs/L2-7b-Hermes-Synthia	Open LLM Leaderboard

Fine-tuned the synthia dataset on the hermes2 7b model

In my opinion it's probably the best model I fine-tuned in-terms of role-playing (tested on LavernAI)

Future plans:

I'll probably do more test in other areas
Will add other languages (Potentially japanese and chinese)
Finetune it on mistral models?

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	52.21
AI2 Reasoning Challenge (25-Shot)	51.02
HellaSwag (10-Shot)	79.12
MMLU (5-Shot)	47.88
TruthfulQA (0-shot)	46.77
Winogrande (5-shot)	74.51
GSM8k (5-shot)	13.95