初始化项目,由ModelHub XC社区提供模型
Model: Envoid/Yousei-22B Source: Original Platform
This commit is contained in:
35
.gitattributes
vendored
Normal file
35
.gitattributes
vendored
Normal file
@@ -0,0 +1,35 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
69
README.md
Normal file
69
README.md
Normal file
@@ -0,0 +1,69 @@
|
||||
# This model may output NSFW content
|
||||
|
||||
## About
|
||||
This model started as a block-diagonal [frankenllama merge](https://huggingface.co/chargoddard/llama2-22b) between [MythoMax13B](https://huggingface.co/Gryphe/MythoMax-L2-13b) and [Enterredaas33B](https://huggingface.co/Aeala/Enterredaas-33b)
|
||||
|
||||
|
||||
However due to some anomaly likely caused by the novel methods used by MythoMax I was unable to initiate the LoRA training needed to bring the resulting model back to order.
|
||||
|
||||
Being a [Chronorctypus-Limarobormes](https://huggingface.co/chargoddard/Chronorctypus-Limarobormes-13b) enjoyer I decided to look further into the TIES-merging that it utilizes- as cited in the arXiv paper: [Resolving Interference When Merging Models](https://huggingface.co/papers/2306.01708
|
||||
)
|
||||
|
||||
I used [llama2-22b](https://huggingface.co/chargoddard/llama2-22b) as the base model upon which I merged the MythoMax/Enterredaas frankenmerge, [Dendrite-II](https://huggingface.co/Envoid/Dendrite-II-22B) and [Bacchus](https://huggingface.co/Envoid/Bacchus-22B)
|
||||
|
||||
Suffice it to say due to being made of merges of merges Yousei-22B has the DNA of many different models in it, although the Llama-2-chat DNA from Dendrite-II and the DNA from MythoMax seem to shine the brightest.
|
||||
|
||||
|
||||
## Usage
|
||||
I no longer use simple-proxy-for-tavern as part of my testing stack and instead test on a combination of the kobold-lite UI and the latest version of silly-tavern (now with more advanced prompt formatting options)
|
||||
|
||||
When using the llama-2 prompt format it tends to produce a lot of refusals when requesting mature or controversial content. When using an alpaca-style format you don't get as many refusals, however they do happen. I have also not yet dialed in what the best generation parameter presets are.
|
||||
|
||||
It's decent at roleplay although sometimes wavers between chat style output and prose. Where this model really shines is instruct prescribed writing.
|
||||
|
||||
## Example
|
||||
### Prompt
|
||||
```
|
||||
### Instruction:
|
||||
Write a poem only a Yōsei could appreicate.
|
||||
### Response:
|
||||
```
|
||||
### Reply
|
||||
```
|
||||
In moonlit glades, where shadows dance and play,
|
||||
Where ancient magic fills the night away,
|
||||
A world of wonder, unseen by day,
|
||||
The yōsei's realm, in softest sway.
|
||||
|
||||
Beneath the boughs of trees so tall,
|
||||
Their leaves like jewels, a treasure to enthrall,
|
||||
The creatures gather, young and old,
|
||||
To share their tales, their joys, their bold.
|
||||
|
||||
With furry friends, both great and small,
|
||||
They frolic, wild and free for all,
|
||||
Through woodland paths, they lead the way,
|
||||
In this enchanted land of endless fray.
|
||||
|
||||
The stars above, their twinkling eyes,
|
||||
Like fireflies that light the midnight skies,
|
||||
Guide these mythical beings, wise and true,
|
||||
As they weave their spells anew.
|
||||
|
||||
The rustle of leaves, the snap of twig,
|
||||
The soft caress of silken wing,
|
||||
Are music to their ears, a gentle hymn,
|
||||
In this mystic forest, where they belong.
|
||||
|
||||
So let us revel in this place,
|
||||
Where nature's secrets find their grace,
|
||||
And bask in the ethereal glow,
|
||||
Of the world that only yōsei know.
|
||||
```
|
||||
If you find you are getting refusals when using instruct prompts you can almost always get around the refusal like this:
|
||||
```
|
||||
### Instruction:
|
||||
Write a story
|
||||
### Response:
|
||||
Certainly! here's a story:
|
||||
```
|
||||
26
config.json
Normal file
26
config.json
Normal file
@@ -0,0 +1,26 @@
|
||||
{
|
||||
"_name_or_path": "chargoddard/llama2-22b",
|
||||
"architectures": [
|
||||
"LlamaForCausalLM"
|
||||
],
|
||||
"bos_token_id": 1,
|
||||
"eos_token_id": 2,
|
||||
"hidden_act": "silu",
|
||||
"hidden_size": 6656,
|
||||
"initializer_range": 0.02,
|
||||
"intermediate_size": 17920,
|
||||
"max_position_embeddings": 4098,
|
||||
"model_type": "llama",
|
||||
"num_attention_heads": 52,
|
||||
"num_hidden_layers": 40,
|
||||
"num_key_value_heads": 52,
|
||||
"pad_token_id": 0,
|
||||
"pretraining_tp": 1,
|
||||
"rms_norm_eps": 1e-05,
|
||||
"rope_scaling": null,
|
||||
"tie_word_embeddings": false,
|
||||
"torch_dtype": "float16",
|
||||
"transformers_version": "4.33.0.dev0",
|
||||
"use_cache": false,
|
||||
"vocab_size": 32000
|
||||
}
|
||||
3
model-00001-of-00005.safetensors
Normal file
3
model-00001-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:958ed5ce13cd62630ec6d3c78b5189e2b0a6996b8aade70f3b5674169c1b96f1
|
||||
size 10244287592
|
||||
3
model-00002-of-00005.safetensors
Normal file
3
model-00002-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:52ed8badfd22992df8776f203077e9756aa24c9cb3df86fffaa085e166c15511
|
||||
size 9958080352
|
||||
3
model-00003-of-00005.safetensors
Normal file
3
model-00003-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c24c0a35485e7f514e433aa4f55b80ff19046ef4bf8f4414f1e8ca2615b89d00
|
||||
size 9896712176
|
||||
3
model-00004-of-00005.safetensors
Normal file
3
model-00004-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3a7395cc84075aaae074a89072ad629f605a5affbb1919c966b5cc979222781c
|
||||
size 9869448720
|
||||
3
model-00005-of-00005.safetensors
Normal file
3
model-00005-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:cf8e62c776de7a4ba703d02c4fbfb035bf8eab4f13e784a38cc808f9f73866aa
|
||||
size 4539409880
|
||||
1
model.safetensors.index.json
Normal file
1
model.safetensors.index.json
Normal file
File diff suppressed because one or more lines are too long
24
special_tokens_map.json
Normal file
24
special_tokens_map.json
Normal file
@@ -0,0 +1,24 @@
|
||||
{
|
||||
"bos_token": {
|
||||
"content": "<s>",
|
||||
"lstrip": false,
|
||||
"normalized": true,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"eos_token": {
|
||||
"content": "</s>",
|
||||
"lstrip": false,
|
||||
"normalized": true,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"pad_token": "<unk>",
|
||||
"unk_token": {
|
||||
"content": "<unk>",
|
||||
"lstrip": false,
|
||||
"normalized": true,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
}
|
||||
}
|
||||
93391
tokenizer.json
Normal file
93391
tokenizer.json
Normal file
File diff suppressed because it is too large
Load Diff
BIN
tokenizer.model
(Stored with Git LFS)
Normal file
BIN
tokenizer.model
(Stored with Git LFS)
Normal file
Binary file not shown.
36
tokenizer_config.json
Normal file
36
tokenizer_config.json
Normal file
@@ -0,0 +1,36 @@
|
||||
{
|
||||
"add_bos_token": true,
|
||||
"add_eos_token": false,
|
||||
"bos_token": {
|
||||
"__type": "AddedToken",
|
||||
"content": "<s>",
|
||||
"lstrip": false,
|
||||
"normalized": true,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"clean_up_tokenization_spaces": false,
|
||||
"eos_token": {
|
||||
"__type": "AddedToken",
|
||||
"content": "</s>",
|
||||
"lstrip": false,
|
||||
"normalized": true,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"legacy": false,
|
||||
"model_max_length": 1000000000000000019884624838656,
|
||||
"pad_token": null,
|
||||
"sp_model_kwargs": {},
|
||||
"spaces_between_special_tokens": false,
|
||||
"tokenizer_class": "LlamaTokenizer",
|
||||
"unk_token": {
|
||||
"__type": "AddedToken",
|
||||
"content": "<unk>",
|
||||
"lstrip": false,
|
||||
"normalized": true,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"use_default_system_prompt": true
|
||||
}
|
||||
Reference in New Issue
Block a user