language, license, tags, pipeline_tag, inference, model-index
| language |
license |
tags |
pipeline_tag |
inference |
model-index |
|
|
apache-2.0 |
| mistral |
| not-for-all-audiences |
| merge |
|
text-generation |
false |
| name |
results |
| DarkSapling-7B-v2.0 |
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| AI2 Reasoning Challenge (25-Shot) |
ai2_arc |
ARC-Challenge |
test |
|
|
| type |
value |
name |
| acc_norm |
64.16 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
split |
args |
| HellaSwag (10-Shot) |
hellaswag |
validation |
|
|
| type |
value |
name |
| acc_norm |
85.1 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| MMLU (5-Shot) |
cais/mmlu |
all |
test |
|
|
| type |
value |
name |
| acc |
64.37 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| TruthfulQA (0-shot) |
truthful_qa |
multiple_choice |
validation |
|
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| Winogrande (5-shot) |
winogrande |
winogrande_xl |
validation |
|
|
| type |
value |
name |
| acc |
78.61 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| GSM8k (5-shot) |
gsm8k |
main |
test |
|
|
| type |
value |
name |
| acc |
45.41 |
accuracy |
|
|
|
|
|
|
|
DarkSapling-7B-v2.0

Model Details
Warning: This model can produce NSFW content!
Results
- a little different than version v1.0, more romantic and empathetic.
- smarter than versions 1.0 and 1.1.
- best for one-on-one ERP.
- produces SFW nad NSFW content without issues, switches context seamlessly.
- sticks to character card
- pretty smart due to mistral, empathetic after Samantha and sometimes produces dark scenarions - Erebus.
- storytelling is satisfactory due to Holodeck
- good at following instructions
All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel:

Detailed results can be found here
| Metric |
Value |
| Avg. |
64.98 |
| AI2 Reasoning Challenge (25-Shot) |
64.16 |
| HellaSwag (10-Shot) |
85.10 |
| MMLU (5-Shot) |
64.37 |
| TruthfulQA (0-shot) |
52.21 |
| Winogrande (5-shot) |
78.61 |
| GSM8k (5-shot) |
45.41 |