初始化项目,由ModelHub XC社区提供模型

Model: NeverSleep/Llama-3-Lumimaid-8B-v0.1
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-18 16:46:33 +08:00
commit e7acd79957
10 changed files with 412858 additions and 0 deletions

35
.gitattributes vendored Normal file
View File

@@ -0,0 +1,35 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text

71
README.md Normal file
View File

@@ -0,0 +1,71 @@
---
license: cc-by-nc-4.0
tags:
- not-for-all-audiences
- nsfw
---
## Lumimaid 0.1
<center><div style="width: 100%;">
<img src="https://cdn-uploads.huggingface.co/production/uploads/630dfb008df86f1e5becadc3/d3QMaxy3peFTpSlWdWF-k.png" style="display: block; margin: auto;">
</div></center>
This model uses the Llama3 **prompting format**
Llama3 trained on our RP datasets, we tried to have a balance between the ERP and the RP, not too horny, but just enough.
We also added some non-RP dataset, making the model less dumb overall. It should look like a 40%/60% ratio for Non-RP/RP+ERP data.
This model includes the new Luminae dataset from Ikari.
If you consider trying this model please give us some feedback either on the Community tab on hf or on our [Discord Server](https://discord.gg/MtCVRWTZXY).
## Credits:
- Undi
- IkariDev
## Description
This repo contains FP16 files of Lumimaid-8B-v0.1.
Switch: [8B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1) - [70B](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1) - [70B-alt](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1-alt) - [8B-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS) - [70B-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-70B-v0.1-OAS)
## Training data used:
- [Aesir datasets](https://huggingface.co/MinervaAI)
- [NoRobots](https://huggingface.co/datasets/Doctor-Shotgun/no-robots-sharegpt)
- [limarp](https://huggingface.co/datasets/lemonilia/LimaRP) - 8k ctx
- [toxic-dpo-v0.1-sharegpt](https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
- [ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
- Luminae-i1 (70B/70B-alt) (i2 was not existing when the 70b started training) | Luminae-i2 (8B) (this one gave better results on the 8b) - Ikari's Dataset
- [Squish42/bluemoon-fandom-1-1-rp-cleaned](https://huggingface.co/datasets/Squish42/bluemoon-fandom-1-1-rp-cleaned) - 50% (randomly)
- [NobodyExistsOnTheInternet/PIPPAsharegptv2test](https://huggingface.co/datasets/NobodyExistsOnTheInternet/PIPPAsharegptv2test) - 5% (randomly)
- [cgato/SlimOrcaDedupCleaned](https://huggingface.co/datasets/cgato/SlimOrcaDedupCleaned) - 5% (randomly)
- Airoboros (reduced)
- [Capybara](https://huggingface.co/datasets/Undi95/Capybara-ShareGPT/) (reduced)
## Models used (only for 8B)
- Initial LumiMaid 8B Finetune
- Undi95/Llama-3-Unholy-8B-e4
- Undi95/Llama-3-LewdPlay-8B
## Prompt template: Llama3
```
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
{output}<|eot_id|>
```
## Others
Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).
IkariDev: Visit my [retro/neocities style website](https://ikaridevgit.github.io/) please kek

28
config.json Normal file
View File

@@ -0,0 +1,28 @@
{
"_name_or_path": "./result/input_models/Roleplay-Llama-3-8B_213413727",
"architectures": [
"LlamaForCausalLM"
],
"attention_bias": false,
"attention_dropout": 0.0,
"bos_token_id": 128000,
"eos_token_id": 128009,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 8192,
"model_type": "llama",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pretraining_tp": 1,
"rms_norm_eps": 1e-05,
"rope_scaling": null,
"rope_theta": 500000.0,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.40.1",
"use_cache": true,
"vocab_size": 128256
}

135
mergekit_config.yml Normal file
View File

@@ -0,0 +1,135 @@
base_model: ./result/input_models/Roleplay-Llama-3-8B_213413727
dtype: bfloat16
merge_method: dare_ties
parameters:
int8_mask: 1.0
normalize: 0.0
slices:
- sources:
- layer_range: [0, 4]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 0.9061440388199886
weight: 0.7420827290507876
- layer_range: [0, 4]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 0.8343357824656759
weight: 0.5634171099678891
- layer_range: [0, 4]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 1.0
weight: 0.03808449036687045
- sources:
- layer_range: [4, 8]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 1.0
weight: 0.040706182952752565
- layer_range: [4, 8]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 1.0
weight: 0.5235663919709214
- layer_range: [4, 8]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 0.6753137462586175
weight: 0.1718739352284447
- sources:
- layer_range: [8, 12]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 0.8144143226543775
weight: 0.2916571301845346
- layer_range: [8, 12]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 0.5944343021653459
weight: 0.6289130590047136
- layer_range: [8, 12]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 0.9096807190417433
weight: 0.18225448981675693
- sources:
- layer_range: [12, 16]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 1.0
weight: 0.31346575871103577
- layer_range: [12, 16]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 1.0
weight: 0.6710513199806648
- layer_range: [12, 16]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 1.0
weight: 0.2620098997126852
- sources:
- layer_range: [16, 20]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 0.7957908643933549
weight: 0.4065602812739591
- layer_range: [16, 20]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 1.0
weight: 0.3833004954478314
- layer_range: [16, 20]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 1.0
weight: 0.3722661530618318
- sources:
- layer_range: [20, 24]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 0.8820161972577153
weight: 0.31407655218805003
- layer_range: [20, 24]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 0.871522940513238
weight: 0.09916802739443117
- layer_range: [20, 24]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 0.843576104996367
weight: 0.48592770058071444
- sources:
- layer_range: [24, 28]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 0.8818663379010269
weight: 0.4128563619116445
- layer_range: [24, 28]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 0.89467562267532
weight: 0.39209478410830645
- layer_range: [24, 28]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 1.0
weight: 0.20302426278165847
- sources:
- layer_range: [28, 32]
model: ./result/input_models/Llama-3-Lumimaid-8B-e1_2058152591
parameters:
density: 0.8679751557926477
weight: 0.5226676522508309
- layer_range: [28, 32]
model: ./result/input_models/Llama-3-Unholy-8B-e4_1440388923
parameters:
density: 0.9145274983719552
weight: 0.4103390562947599
- layer_range: [28, 32]
model: ./result/input_models/Roleplay-Llama-3-8B_213413727
parameters:
density: 0.7116071161471552
weight: 0.5557266216543452

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:714c4800e60486b7ff76e2ace03f69c06e4e0b80f74d997e0051605c49681c66
size 9953405736

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c2ddbca3ada965777653640be6493efa38f616a75f876212099d242abfae31d7
size 6107150624

File diff suppressed because one or more lines are too long

16
special_tokens_map.json Normal file
View File

@@ -0,0 +1,16 @@
{
"bos_token": {
"content": "<|begin_of_text|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"eos_token": {
"content": "<|end_of_text|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}

410504
tokenizer.json Normal file

File diff suppressed because it is too large Load Diff

2062
tokenizer_config.json Normal file

File diff suppressed because it is too large Load Diff