初始化项目,由ModelHub XC社区提供模型
Model: NeverSleep/Llama-3-Lumimaid-8B-v0.1 Source: Original Platform
This commit is contained in:
35
.gitattributes
vendored
Normal file
35
.gitattributes
vendored
Normal file
@@ -0,0 +1,35 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
71
README.md
Normal file
71
README.md
Normal file
@@ -0,0 +1,71 @@
|
||||
---
|
||||
license: cc-by-nc-4.0
|
||||
tags:
|
||||
- not-for-all-audiences
|
||||
- nsfw
|
||||
---
|
||||
|
||||
## Lumimaid 0.1
|
||||
|
||||
<center><div style="width: 100%;">
|
||||
<img src="https://cdn-uploads.huggingface.co/production/uploads/630dfb008df86f1e5becadc3/d3QMaxy3peFTpSlWdWF-k.png" style="display: block; margin: auto;">
|
||||
</div></center>
|
||||
|
||||
This model uses the Llama3 **prompting format**
|
||||
|
||||
Llama3 trained on our RP datasets, we tried to have a balance between the ERP and the RP, not too horny, but just enough.
|
||||
|
||||
We also added some non-RP dataset, making the model less dumb overall. It should look like a 40%/60% ratio for Non-RP/RP+ERP data.
|
||||
|
||||
This model includes the new Luminae dataset from Ikari.
|
||||
|
||||
|
||||
If you consider trying this model please give us some feedback either on the Community tab on hf or on our [Discord Server](https://discord.gg/MtCVRWTZXY).
|
||||
|
||||
## Credits:
|
||||
- Undi
|
||||
- IkariDev
|
||||
|
||||
## Description
|
||||
|
||||
This repo contains FP16 files of Lumimaid-8B-v0.1.
|
||||
|
||||
Switch: [8B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1) - [70B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1) - [70B-alt](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1-alt) - [8B-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS) - [70B-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1-OAS)
|
||||
|
||||
## Training data used:
|
||||
- [Aesir datasets](https://huggingface.co/MinervaAI)
|
||||
- [NoRobots](https://huggingface.co/datasets/Doctor-Shotgun/no-robots-sharegpt)
|
||||
- [limarp](https://huggingface.co/datasets/lemonilia/LimaRP) - 8k ctx
|
||||
- [toxic-dpo-v0.1-sharegpt](https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
|
||||
- [ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
|
||||
- Luminae-i1 (70B/70B-alt) (i2 was not existing when the 70b started training) | Luminae-i2 (8B) (this one gave better results on the 8b) - Ikari's Dataset
|
||||
- [Squish42/bluemoon-fandom-1-1-rp-cleaned](https://huggingface.co/datasets/Squish42/bluemoon-fandom-1-1-rp-cleaned) - 50% (randomly)
|
||||
- [NobodyExistsOnTheInternet/PIPPAsharegptv2test](https://huggingface.co/datasets/NobodyExistsOnTheInternet/PIPPAsharegptv2test) - 5% (randomly)
|
||||
- [cgato/SlimOrcaDedupCleaned](https://huggingface.co/datasets/cgato/SlimOrcaDedupCleaned) - 5% (randomly)
|
||||
- Airoboros (reduced)
|
||||
- [Capybara](https://huggingface.co/datasets/Undi95/Capybara-ShareGPT/) (reduced)
|
||||
|
||||
|
||||
## Models used (only for 8B)
|
||||
|
||||
- Initial LumiMaid 8B Finetune
|
||||
- Undi95/Llama-3-Unholy-8B-e4
|
||||
- Undi95/Llama-3-LewdPlay-8B
|
||||
|
||||
## Prompt template: Llama3
|
||||
|
||||
```
|
||||
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
|
||||
|
||||
{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
|
||||
|
||||
{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
|
||||
|
||||
{output}<|eot_id|>
|
||||
```
|
||||
|
||||
## Others
|
||||
|
||||
Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).
|
||||
|
||||
IkariDev: Visit my [retro/neocities style website](https://ikaridevgit.github.io/) please kek
|
||||
28
config.json
Normal file
28
config.json
Normal file
@@ -0,0 +1,28 @@
|
||||
{
|
||||
"_name_or_path": "./result/input_models/Roleplay-Llama-3-8B_213413727",
|
||||
"architectures": [
|
||||
"LlamaForCausalLM"
|
||||
],
|
||||
"attention_bias": false,
|
||||
"attention_dropout": 0.0,
|
||||
"bos_token_id": 128000,
|
||||
"eos_token_id": 128009,
|
||||
"hidden_act": "silu",
|
||||
"hidden_size": 4096,
|
||||
"initializer_range": 0.02,
|
||||
"intermediate_size": 14336,
|
||||
"max_position_embeddings": 8192,
|
||||
"model_type": "llama",
|
||||
"num_attention_heads": 32,
|
||||
"num_hidden_layers": 32,
|
||||
"num_key_value_heads": 8,
|
||||
"pretraining_tp": 1,
|
||||
"rms_norm_eps": 1e-05,
|
||||
"rope_scaling": null,
|
||||
"rope_theta": 500000.0,
|
||||
"tie_word_embeddings": false,
|
||||
"torch_dtype": "bfloat16",
|
||||
"transformers_version": "4.40.1",
|
||||
"use_cache": true,
|
||||
"vocab_size": 128256
|
||||
}
|
||||
135
mergekit_config.yml
Normal file
135
mergekit_config.yml
Normal file
@@ -0,0 +1,135 @@
|
||||
base_model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
dtype: bfloat16
|
||||
merge_method: dare_ties
|
||||
parameters:
|
||||
int8_mask: 1.0
|
||||
normalize: 0.0
|
||||
slices:
|
||||
- sources:
|
||||
- layer_range: [0, 4]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 0.9061440388199886
|
||||
weight: 0.7420827290507876
|
||||
- layer_range: [0, 4]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 0.8343357824656759
|
||||
weight: 0.5634171099678891
|
||||
- layer_range: [0, 4]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.03808449036687045
|
||||
- sources:
|
||||
- layer_range: [4, 8]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.040706182952752565
|
||||
- layer_range: [4, 8]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.5235663919709214
|
||||
- layer_range: [4, 8]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 0.6753137462586175
|
||||
weight: 0.1718739352284447
|
||||
- sources:
|
||||
- layer_range: [8, 12]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 0.8144143226543775
|
||||
weight: 0.2916571301845346
|
||||
- layer_range: [8, 12]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 0.5944343021653459
|
||||
weight: 0.6289130590047136
|
||||
- layer_range: [8, 12]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 0.9096807190417433
|
||||
weight: 0.18225448981675693
|
||||
- sources:
|
||||
- layer_range: [12, 16]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.31346575871103577
|
||||
- layer_range: [12, 16]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.6710513199806648
|
||||
- layer_range: [12, 16]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.2620098997126852
|
||||
- sources:
|
||||
- layer_range: [16, 20]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 0.7957908643933549
|
||||
weight: 0.4065602812739591
|
||||
- layer_range: [16, 20]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.3833004954478314
|
||||
- layer_range: [16, 20]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.3722661530618318
|
||||
- sources:
|
||||
- layer_range: [20, 24]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 0.8820161972577153
|
||||
weight: 0.31407655218805003
|
||||
- layer_range: [20, 24]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 0.871522940513238
|
||||
weight: 0.09916802739443117
|
||||
- layer_range: [20, 24]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 0.843576104996367
|
||||
weight: 0.48592770058071444
|
||||
- sources:
|
||||
- layer_range: [24, 28]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 0.8818663379010269
|
||||
weight: 0.4128563619116445
|
||||
- layer_range: [24, 28]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 0.89467562267532
|
||||
weight: 0.39209478410830645
|
||||
- layer_range: [24, 28]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 1.0
|
||||
weight: 0.20302426278165847
|
||||
- sources:
|
||||
- layer_range: [28, 32]
|
||||
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
|
||||
parameters:
|
||||
density: 0.8679751557926477
|
||||
weight: 0.5226676522508309
|
||||
- layer_range: [28, 32]
|
||||
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
|
||||
parameters:
|
||||
density: 0.9145274983719552
|
||||
weight: 0.4103390562947599
|
||||
- layer_range: [28, 32]
|
||||
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
|
||||
parameters:
|
||||
density: 0.7116071161471552
|
||||
weight: 0.5557266216543452
|
||||
3
model-00001-of-00002.safetensors
Normal file
3
model-00001-of-00002.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:714c4800e60486b7ff76e2ace03f69c06e4e0b80f74d997e0051605c49681c66
|
||||
size 9953405736
|
||||
3
model-00002-of-00002.safetensors
Normal file
3
model-00002-of-00002.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c2ddbca3ada965777653640be6493efa38f616a75f876212099d242abfae31d7
|
||||
size 6107150624
|
||||
1
model.safetensors.index.json
Normal file
1
model.safetensors.index.json
Normal file
File diff suppressed because one or more lines are too long
16
special_tokens_map.json
Normal file
16
special_tokens_map.json
Normal file
@@ -0,0 +1,16 @@
|
||||
{
|
||||
"bos_token": {
|
||||
"content": "<|begin_of_text|>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"eos_token": {
|
||||
"content": "<|end_of_text|>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
}
|
||||
}
|
||||
410504
tokenizer.json
Normal file
410504
tokenizer.json
Normal file
File diff suppressed because it is too large
Load Diff
2062
tokenizer_config.json
Normal file
2062
tokenizer_config.json
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user