初始化项目,由ModelHub XC社区提供模型
Model: FallenMerick/MN-Violet-Lotus-12B Source: Original Platform
This commit is contained in:
36
.gitattributes
vendored
Normal file
36
.gitattributes
vendored
Normal file
@@ -0,0 +1,36 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
||||
81
README.md
Normal file
81
README.md
Normal file
@@ -0,0 +1,81 @@
|
||||
---
|
||||
license: cc-by-4.0
|
||||
language:
|
||||
- en
|
||||
base_model:
|
||||
- mistralai/Mistral-Nemo-Instruct-2407
|
||||
- Epiculous/Violet_Twilight-v0.2
|
||||
- NeverSleep/Lumimaid-v0.2-12B
|
||||
- flammenai/Mahou-1.5-mistral-nemo-12B
|
||||
- Sao10K/MN-12B-Lyra-v4
|
||||
library_name: transformers
|
||||
tags:
|
||||
- storywriting
|
||||
- text adventure
|
||||
- creative
|
||||
- story
|
||||
- writing
|
||||
- fiction
|
||||
- roleplaying
|
||||
- rp
|
||||
- mergekit
|
||||
- merge
|
||||
|
||||
---
|
||||
|
||||

|
||||
|
||||
# MN-Violet-Lotus-12B
|
||||
|
||||
This is the model I was trying to create when [Chunky-Lotus](https://huggingface.co/FallenMerick/MN-Chunky-Lotus-12B) emerged. Not only does this model score higher on my local EQ benchmarks (80.00 w/ 100% parsed @ 8-bit), but it does an even better job at roleplaying and producing creative outputs while still adhering to wide ranges of character personalities. The high levels of emotional intelligence are really quite noticeable as well.
|
||||
|
||||

|
||||
|
||||
Once again, models tend to score higher on my local tests when compared to their posted scores, but this has become the new high score for models I've personally tested.
|
||||
|
||||
I really like the way this model writes, and I hope you'll enjoy using it as well!
|
||||
|
||||
GGUF Quants:
|
||||
* https://huggingface.co/backyardai/MN-Violet-Lotus-12B-GGUF
|
||||
* https://huggingface.co/mradermacher/MN-Violet-Lotus-12B-GGUF
|
||||
* https://huggingface.co/mradermacher/MN-Violet-Lotus-12B-i1-GGUF
|
||||
|
||||
## Recommended ST Settings
|
||||
|
||||
Special thanks to [@Zeldazachman](https://huggingface.co/Zeldazackman) for these amazing ST settings that I now wholeheartedly recommend!
|
||||
|
||||

|
||||
|
||||
## Merge Details
|
||||
|
||||
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
||||
|
||||
### Merge Method
|
||||
|
||||
This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method.
|
||||
|
||||
### Models Merged
|
||||
|
||||
The following models were included in the merge:
|
||||
* [Epiculous/Violet_Twilight-v0.2](https://huggingface.co/Epiculous/Violet_Twilight-v0.2)
|
||||
* [NeverSleep/Lumimaid-v0.2-12B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-12B)
|
||||
* [flammenai/Mahou-1.5-mistral-nemo-12B](https://huggingface.co/flammenai/Mahou-1.5-mistral-nemo-12B)
|
||||
* [Sao10K/MN-12B-Lyra-v4](https://huggingface.co/Sao10K/MN-12B-Lyra-v4)
|
||||
|
||||
### Configuration
|
||||
|
||||
The following YAML configuration was used to produce this model:
|
||||
|
||||
```yaml
|
||||
models:
|
||||
- model: FallenMerick/MN-Twilight-Maid-SLERP-12B #(unreleased)
|
||||
- model: Sao10K/MN-12B-Lyra-v4
|
||||
- model: flammenai/Mahou-1.5-mistral-nemo-12B
|
||||
merge_method: model_stock
|
||||
base_model: mistralai/Mistral-Nemo-Instruct-2407
|
||||
parameters:
|
||||
normalize: true
|
||||
dtype: bfloat16
|
||||
```
|
||||
|
||||
In this recipe, Violet Twilight and Lumimaid were first blended using the SLERP method to create a strong roleplaying foundation. Lyra v4 is then added to the mix for its great creativity and roleplaying performance, along with Mahou to once again curtail the outputs and prevent the resulting model from becoming too wordy. Model Stock was used for the final merge in order to really push the resulting weights in the proper direction while using Nemo Instruct as a strong anchor point.
|
||||
BIN
ST-Preset.png
Normal file
BIN
ST-Preset.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 80 KiB |
27
config.json
Normal file
27
config.json
Normal file
@@ -0,0 +1,27 @@
|
||||
{
|
||||
"_name_or_path": "FallenMerick/MN-Violet-Lotus-12B",
|
||||
"architectures": [
|
||||
"MistralForCausalLM"
|
||||
],
|
||||
"attention_dropout": 0.0,
|
||||
"bos_token_id": 1,
|
||||
"eos_token_id": 2,
|
||||
"head_dim": 128,
|
||||
"hidden_act": "silu",
|
||||
"hidden_size": 5120,
|
||||
"initializer_range": 0.02,
|
||||
"intermediate_size": 14336,
|
||||
"max_position_embeddings": 131072,
|
||||
"model_type": "mistral",
|
||||
"num_attention_heads": 32,
|
||||
"num_hidden_layers": 40,
|
||||
"num_key_value_heads": 8,
|
||||
"rms_norm_eps": 1e-05,
|
||||
"rope_theta": 1000000.0,
|
||||
"sliding_window": null,
|
||||
"tie_word_embeddings": false,
|
||||
"torch_dtype": "bfloat16",
|
||||
"transformers_version": "4.46.0",
|
||||
"use_cache": true,
|
||||
"vocab_size": 131072
|
||||
}
|
||||
9
mergekit_config.yml
Normal file
9
mergekit_config.yml
Normal file
@@ -0,0 +1,9 @@
|
||||
models:
|
||||
- model: FallenMerick/MN-Twilight-Maid-SLERP-12B
|
||||
- model: Sao10K/MN-12B-Lyra-v4
|
||||
- model: flammenai/Mahou-1.5-mistral-nemo-12B
|
||||
merge_method: model_stock
|
||||
base_model: mistralai/Mistral-Nemo-Instruct-2407
|
||||
parameters:
|
||||
normalize: true
|
||||
dtype: bfloat16
|
||||
3
model-00001-of-00005.safetensors
Normal file
3
model-00001-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d4cf1c3f2cc2815435738de21667b8b485660faaa2f922137e542e160ab918ce
|
||||
size 4865489336
|
||||
3
model-00002-of-00005.safetensors
Normal file
3
model-00002-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0d8bcb0f48d7a89a20116e6408e502fa4ea65f30d9a746c20d6f72b694edea04
|
||||
size 4907529456
|
||||
3
model-00003-of-00005.safetensors
Normal file
3
model-00003-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:03ba65bf47833fa67e6f30f4e2f7bc2da0578a72170dba2fdb52fa9db59eef5b
|
||||
size 4907529464
|
||||
3
model-00004-of-00005.safetensors
Normal file
3
model-00004-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8c4f949e3764d8531c284070fefa192ba95f7968bace9e7231bfb95ed5641090
|
||||
size 4907529456
|
||||
3
model-00005-of-00005.safetensors
Normal file
3
model-00005-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3354dc6982ce70bcc8d9e23bacd8ce78cb1db38c75afbbff0534495c6c0975ed
|
||||
size 4907529392
|
||||
1
model.safetensors.index.json
Normal file
1
model.safetensors.index.json
Normal file
File diff suppressed because one or more lines are too long
23
special_tokens_map.json
Normal file
23
special_tokens_map.json
Normal file
@@ -0,0 +1,23 @@
|
||||
{
|
||||
"bos_token": {
|
||||
"content": "<s>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"eos_token": {
|
||||
"content": "</s>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"unk_token": {
|
||||
"content": "<unk>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
}
|
||||
}
|
||||
3
tokenizer.json
Normal file
3
tokenizer.json
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:b0240ce510f08e6c2041724e9043e33be9d251d1e4a4d94eb68cd47b954b61d2
|
||||
size 17078292
|
||||
8018
tokenizer_config.json
Normal file
8018
tokenizer_config.json
Normal file
File diff suppressed because it is too large
Load Diff
BIN
violet-lotus.jpg
Normal file
BIN
violet-lotus.jpg
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 871 KiB |
Reference in New Issue
Block a user