Go to file

ModelHub XC 7a44a58fa3 初始化项目，由ModelHub XC社区提供模型

Model: TeeZee/DarkSapling-7B-v1.1
Source: Original Platform

2026-05-03 22:07:02 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

DarkSapling-7B-v1.1.jpg

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

mergekit-config.yml

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

model-00001-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

model-00002-of-00002.safetensors

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

model.safetensors.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

tokenizer.json

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

tokenizer.model

初始化项目，由ModelHub XC社区提供模型

2026-05-03 22:07:02 +08:00

README.md

language, license, tags, pipeline_tag, inference, model-index

language

license

tags

pipeline_tag

inference

model-index

apache-2.0

mistral

not-for-all-audiences

merge

text-generation

false

name

results

DarkSapling-7B-v1.1

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	63.48	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v1.1	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	85.09	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v1.1	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	64.47	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v1.1	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	52.04

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v1.1	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	78.53	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v1.1	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	45.19	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v1.1	Open LLM Leaderboard

DarkSapling-7B-v1.1

Model Details

A result of 4 models merge.
models used for merge: cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser KoboldAI/Mistral-7B-Holodeck-1 KoboldAI/Mistral-7B-Erebus-v3 cognitivecomputations/samantha-mistral-7b
See mergekit-config.yml for details on the merge method used.

Warning: This model can produce NSFW content!

Results

a little different than version v1.0, more romantic and empathetic.
best for one-on-one ERP.
produces SFW nad NSFW content without issues, switches context seamlessly.
sticks to character card
pretty smart due to mistral, empathetic after Samantha and sometimes produces dark scenarions - Erebus.
storytelling is satisfactory due to Holodeck
good at following instructions

All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel:

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	64.80
AI2 Reasoning Challenge (25-Shot)	63.48
HellaSwag (10-Shot)	85.09
MMLU (5-Shot)	64.47
TruthfulQA (0-shot)	52.04
Winogrande (5-shot)	78.53
GSM8k (5-shot)	45.19