Files

ModelHub XC 14c05e1d6d 初始化项目，由ModelHub XC社区提供模型

Model: TeeZee/DarkSapling-7B-v2.0
Source: Original Platform

2026-05-26 01:10:17 +08:00

5.0 KiB

Raw Permalink Blame History

language, license, tags, pipeline_tag, inference, model-index

language

license

tags

pipeline_tag

inference

model-index

apache-2.0

mistral

not-for-all-audiences

merge

text-generation

false

name

results

DarkSapling-7B-v2.0

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

AI2 Reasoning Challenge (25-Shot)

ai2_arc

ARC-Challenge

test

num_few_shot
25

type	value	name
acc_norm	64.16	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v2.0	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

split

args

HellaSwag (10-Shot)

hellaswag

validation

num_few_shot
10

type	value	name
acc_norm	85.1	normalized accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v2.0	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

MMLU (5-Shot)

cais/mmlu

all

test

num_few_shot
5

type	value	name
acc	64.37	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v2.0	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

TruthfulQA (0-shot)

truthful_qa

multiple_choice

validation

num_few_shot
0

type	value
mc2	52.21

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v2.0	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

Winogrande (5-shot)

winogrande

winogrande_xl

validation

num_few_shot
5

type	value	name
acc	78.61	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v2.0	Open LLM Leaderboard

task

dataset

metrics

source

type	name
text-generation	Text Generation

name

type

config

split

args

GSM8k (5-shot)

gsm8k

main

test

num_few_shot
5

type	value	name
acc	45.41	accuracy

url	name
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=TeeZee/DarkSapling-7B-v2.0	Open LLM Leaderboard

DarkSapling-7B-v2.0

Model Details

A result of 4 models merge. DARE TIES method was used this time so resulting model better preserves characteristics of all included models than v1.x.
models used for merge: cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser KoboldAI/Mistral-7B-Holodeck-1 KoboldAI/Mistral-7B-Erebus-v3 cognitivecomputations/samantha-mistral-7b
See mergekit-config.yml for details on the merge method used.

Warning: This model can produce NSFW content!

Results

a little different than version v1.0, more romantic and empathetic.
smarter than versions 1.0 and 1.1.
best for one-on-one ERP.
produces SFW nad NSFW content without issues, switches context seamlessly.
sticks to character card
pretty smart due to mistral, empathetic after Samantha and sometimes produces dark scenarions - Erebus.
storytelling is satisfactory due to Holodeck
good at following instructions

All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel:

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	64.98
AI2 Reasoning Challenge (25-Shot)	64.16
HellaSwag (10-Shot)	85.10
MMLU (5-Shot)	64.37
TruthfulQA (0-shot)	52.21
Winogrande (5-shot)	78.61
GSM8k (5-shot)	45.41

5.0 KiB Raw Permalink Blame History

DarkSapling-7B-v2.0

Model Details

Results

Open LLM Leaderboard Evaluation Results

5.0 KiB

Raw Permalink Blame History