初始化项目,由ModelHub XC社区提供模型

Model: Aryanne/MixSwap
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-01 03:56:19 +08:00
commit 369460dc18
51 changed files with 91539 additions and 0 deletions

37
.gitattributes vendored Normal file
View File

@@ -0,0 +1,37 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
q4_k_s.gguf filter=lfs diff=lfs merge=lfs -text
f16.gguf filter=lfs diff=lfs merge=lfs -text

102
README.md Normal file
View File

@@ -0,0 +1,102 @@
---
base_model:
- cognitivecomputations/dolphin-2.2.1-mistral-7b
- l3utterfly/mistral-7b-v0.1-layla-v4-chatml
- teknium/Mistral-Trismegistus-7B
- Aryanne/Open-StarLake-Swap-7B
library_name: transformers
tags:
- mergekit
- merge
license: apache-2.0
---
# MixSwap
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit), but my branch was used
[here](https://github.com/Ar57m/mergekit/tree/swapping)
## Merge Details
### Merge Method
This model was merged using the task_swapping merge method using [Aryanne/Open-StarLake-Swap-7B](https://huggingface.co/Aryanne/Open-StarLake-Swap-7B) as a base.
### Models Merged
The following models were included in the merge:
* [cognitivecomputations/dolphin-2.2.1-mistral-7b](https://huggingface.co/cognitivecomputations/dolphin-2.2.1-mistral-7b)
* [teknium/Mistral-Trismegistus-7B](https://huggingface.co/teknium/Mistral-Trismegistus-7B)
* [l3utterfly/mistral-7b-v0.1-layla-v4-chatml](https://huggingface.co/l3utterfly/mistral-7b-v0.1-layla-v4-chatml)
### Prompt Format:
I prefer using this way, which seems to work.
### Example using Koboldcpp:
Start Seq.:
```
\nYour_name:
```
End Seq.:
```
\nCharacter_name:
```
In Memory
```
### Instruction:
Character description.
Generate a endless verbose(very descriptive) role-play conversation with Character_name.
### Response:
Your_name: how are you doing babe? *Your_name approaches Character_name and kisses her in the lips*
Character_name: I'm fine, it's been an weird day. *Character_name blushes and hugs Your_name with love*
```
### Configuration
The following YAML configuration was used to produce this model:
```yaml
base_model:
model:
path: Aryanne/Open-StarLake-Swap-7B
dtype: bfloat16
merge_method: task_swapping
slices:
- sources:
- layer_range: [0, 32]
model:
model:
path: l3utterfly/mistral-7b-v0.1-layla-v4-chatml
parameters:
diagonal_offset: 4.0
random_mask: 0.1
random_mask_seed: 1956557.0
weight: 0.4
- layer_range: [0, 32]
model:
model:
path: cognitivecomputations/dolphin-2.2.1-mistral-7b
parameters:
diagonal_offset: 4.0
random_mask: 0.1
random_mask_seed: 18019.0
weight: 0.333
- layer_range: [0, 32]
model:
model:
path: teknium/Mistral-Trismegistus-7B
parameters:
diagonal_offset: 4.0
random_mask: 0.05
random_mask_seed: 666666.0
weight: 0.5
- layer_range: [0, 32]
model:
model:
path: Aryanne/Open-StarLake-Swap-7B
```

28
config.json Normal file
View File

@@ -0,0 +1,28 @@
{
"_name_or_path": "/content/mergekit/test",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 2,
"rms_norm_eps": 1e-05,
"rope_theta": 10000.0,
"sliding_window": 4096,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.38.2",
"unsloth_version": "2024.1",
"use_cache": true,
"vocab_size": 32000
}

3
f16.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0c81a46068dbe22dbfe5a0c8a1b8e9e863a985a1cf9e764f066bf6f0042939ee
size 14484731616

38
mergekit_config.yml Normal file
View File

@@ -0,0 +1,38 @@
base_model:
model:
path: Aryanne/Open-StarLake-Swap-7B
dtype: bfloat16
merge_method: task_swapping
slices:
- sources:
- layer_range: [0, 32]
model:
model:
path: l3utterfly/mistral-7b-v0.1-layla-v4-chatml
parameters:
diagonal_offset: 4.0
random_mask: 0.1
random_mask_seed: 1956557.0
weight: 0.4
- layer_range: [0, 32]
model:
model:
path: cognitivecomputations/dolphin-2.2.1-mistral-7b
parameters:
diagonal_offset: 4.0
random_mask: 0.1
random_mask_seed: 18019.0
weight: 0.333
- layer_range: [0, 32]
model:
model:
path: teknium/Mistral-Trismegistus-7B
parameters:
diagonal_offset: 4.0
random_mask: 0.05
random_mask_seed: 666666.0
weight: 0.5
- layer_range: [0, 32]
model:
model:
path: Aryanne/Open-StarLake-Swap-7B

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0e6a6aa473deb265ac14dbd25b868cf7d03a5217cd776e5adffb721a5e371463
size 318767832

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4bbecae2c227fde3ae9492dbb001ef781c41553254bedc8ed68492a94fdf0bd2
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:906e7a0b883a012ec72e28f8d11659839c379340d9ea7780bb94c28bdc54852e
size 360719392

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b34e13b0f968d3508c75bb50ed9cfb8ac88a0bef1ed37e45cbfd0e58f6fd69fb
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e4d9391dd9bc2ce2c67ad04e3e8b3db4d824b130e51e1998f90423a2267b979d
size 360736008

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c4c46dd290dc1aadbd8bcdd1fb0a237ba54e5d9aa92a6a1ceae9e85243a75a5a
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f3736ca6c3f860e775c50c82d276c55af964408e6ebcc5d740beb76a5deb60f7
size 360735992

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:28e7b6753cc0e68ef99e1eda6d3926efb198b54b87d2992336f014b6429b2492
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:99e974104d0221644025228c13ec88c25725444f200abf9a1fc8992352bf1181
size 394273488

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a5017034db4ca2a70b25be89d6e4df11daf768c0a81a527f25d314ec97fc6cc1
size 394281800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:eade5de3adf99c674cf2c1a021929844d687443a4490c9b4d5ef3a590ce5cdaf
size 394281792

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cfb669bc7bc221546ca781074627438abc5169d59b47dafad8cd198c2093b656
size 369108128

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:82012e7f8307affeec0edc58541f3c14cc82794b514792e7f10e1502b67866d4
size 385893072

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8fe2abbcd3901b5c402ded8d897a39c54f6a540c82369cd5698816b1223c20ff
size 369116432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2014b25d1c1e4bced228afda5f170f015b874f43c7dac20f608a926a9e2d5c76
size 385893072

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1449c15041fc39a470a65ca3fdd20ce6a604e4eb42f5ad0dc2ebc77904a43ac4
size 369116416

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:779797e4db9712945dd9e21f4a16938a6ea6f72ea3e0bd020f4ba32ec1c477e2
size 385893072

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:df4e8772d22a94622864fc30a5a64678e0c7e5494a94fe4ea5ae5bc6758fe5bf
size 394273488

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d31e91c9823560b9db76a8dc799aafa3371eb89627102425bfa24ea097b407a8
size 318776128

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0d6f91ef2c06efeab124ce58defb5bff93f691361505222e59d6f7da07583108
size 394281792

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0ca7fa8c48033ba81125af31e0232dbcd7a6a4c20774f841bc7f599aef2e923a
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ef17bdd8a52f66c7994e6480bf1c0f5b421c82a9a51e4502aa3abbe772112441
size 318776128

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1a0873901e3a4e088bf79c6077aea45f7677fb70e94343ee8e61bf8c5fc23dbb
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2cd3c0839cf8d12e44b36ffcbbdb0e578103680486a36d95b42bbe213d9affd3
size 318784424

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a0bf5d630f90dc83a19e8e4b6cae1bac2e1f5ab1ae38e07075f747d17c924c48
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:bc2c5f9b3e5ef04e9a31c2a5d87d86ef5493ed595a2d5c362897c7470271004f
size 360736000

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:87fb1d7683f312a15725d6b61cb389a8e7c5475936973d789dcfd422b0b3a15c
size 379601376

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3c37180f833c3970fcf598f79d0d1c5d53f3b0b43489542a54d2b5d50468b76e
size 379593032

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:81355484bbe3b393c21823a5d35f14f75cc4525fb7225f8d9d17b7c588d5d224
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:82c0c9e2fc8357d8771d5be7553c1ab2a18ee5ff89ed55b604b27d0ebc3ad71c
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5c0132b58efb9838e33cc2561d00e452adcaff0c87303809fe8ffe4c840cfa57
size 394281800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6e769edb91ef331061b91b532d8eaa4f1a909382a52c024bda72a407d047ead3
size 318776136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e281b64ee3850ec8ce93480b3eb3bb8e15bcab85412c8730665fc5e62493af1d
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:c2258dfb11ee8c12de2c6d452ce4c628d60ca691890cc56a5f14b2a779e5accf
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d94b83910a0c9dd0928431cc79be0816f53c6345b86827eece9c404eeccd6cd3
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e77ade02dca396050f60859587b4d2f057d6ad5c51229b7cf42845f83ee3c32a
size 394281800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5806fc6e7dc48a5a60770052de5b25a0658867053e72b5584bce81acd0e38e8e
size 318776136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3d0175cee2285cae8a34021fb7c0c0f3a943c4ccb55a4565c0c87a8eb77c5383
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1f7cbe7edf5ff143f8e3ceeb7964174755edda087026674d349f56ee6b6f7f65
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b64e57ef7930da117a1b2a0cc8ad35fab14f7d2b03aa6bba6ab4c5ee83e1e658
size 352338512

File diff suppressed because one or more lines are too long

3
q4_k_s.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cd067a538f477cdbfc60876e310be5323208d9ecf52465be3b0a8b136819c1c1
size 4140373792

35
special_tokens_map.json Normal file
View File

@@ -0,0 +1,35 @@
{
"additional_special_tokens": [
"<unk>",
"<s>",
"</s>"
],
"bos_token": {
"content": "<s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"eos_token": {
"content": "</s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"pad_token": {
"content": "<unk>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"unk_token": {
"content": "<unk>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}

91122
tokenizer.json Normal file

File diff suppressed because it is too large Load Diff

BIN
tokenizer.model (Stored with Git LFS) Normal file

Binary file not shown.

47
tokenizer_config.json Normal file
View File

@@ -0,0 +1,47 @@
{
"add_bos_token": true,
"add_eos_token": false,
"added_tokens_decoder": {
"0": {
"content": "<unk>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"1": {
"content": "<s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"2": {
"content": "</s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
}
},
"additional_special_tokens": [
"<unk>",
"<s>",
"</s>"
],
"bos_token": "<s>",
"clean_up_tokenization_spaces": false,
"eos_token": "</s>",
"legacy": true,
"model_max_length": 255,
"pad_token": "<unk>",
"padding_side": "right",
"sp_model_kwargs": {},
"spaces_between_special_tokens": false,
"tokenizer_class": "LlamaTokenizer",
"unk_token": "<unk>",
"use_default_system_prompt": true
}