language, license, tags, pipeline_tag, inference, model-index
| language |
license |
tags |
pipeline_tag |
inference |
model-index |
|
|
apache-2.0 |
| mistral |
| not-for-all-audiences |
| merge |
|
text-generation |
false |
| name |
results |
| DarkSapling-7B-v1.0 |
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| AI2 Reasoning Challenge (25-Shot) |
ai2_arc |
ARC-Challenge |
test |
|
|
| type |
value |
name |
| acc_norm |
61.6 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
split |
args |
| HellaSwag (10-Shot) |
hellaswag |
validation |
|
|
| type |
value |
name |
| acc_norm |
82.59 |
normalized accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| MMLU (5-Shot) |
cais/mmlu |
all |
test |
|
|
| type |
value |
name |
| acc |
62.46 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| TruthfulQA (0-shot) |
truthful_qa |
multiple_choice |
validation |
|
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| Winogrande (5-shot) |
winogrande |
winogrande_xl |
validation |
|
|
| type |
value |
name |
| acc |
77.19 |
accuracy |
|
|
|
|
| task |
dataset |
metrics |
source |
| type |
name |
| text-generation |
Text Generation |
|
| name |
type |
config |
split |
args |
| GSM8k (5-shot) |
gsm8k |
main |
test |
|
|
| type |
value |
name |
| acc |
40.18 |
accuracy |
|
|
|
|
|
|
|
DarkSapling-7B-v1.0

Model Details
Warning: This model can produce NSFW content!
Results
- produces SFW nad NSFW content without issues, switches context seamlessly.
- sticks to character card
- pretty smart due to mistral, empathetic after Samantha and sometimes produces dark scenarions - Erebus.
- storytelling is satisfactory due to Holodeck
- good at following instructions
All comments are greatly appreciated, download, test and if you appreciate my work, consider buying me my fuel:

Detailed results can be found here
| Metric |
Value |
| Avg. |
61.52 |
| AI2 Reasoning Challenge (25-Shot) |
61.60 |
| HellaSwag (10-Shot) |
82.59 |
| MMLU (5-Shot) |
62.46 |
| TruthfulQA (0-shot) |
45.09 |
| Winogrande (5-shot) |
77.19 |
| GSM8k (5-shot) |
40.18 |