初始化项目,由ModelHub XC社区提供模型

Model: Aryanne/WestSenzu-Swap-7B
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-08 10:59:20 +08:00
commit 375108b0a2
52 changed files with 91558 additions and 0 deletions

37
.gitattributes vendored Normal file
View File

@@ -0,0 +1,37 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text
f16.gguf filter=lfs diff=lfs merge=lfs -text

158
README.md Normal file
View File

@@ -0,0 +1,158 @@
---
license: apache-2.0
library_name: transformers
tags:
- mergekit
- merge
base_model:
- NeuralNovel/Senzu-7B-v0.1-DPO
- senseable/WestLake-7B-v2
model-index:
- name: WestSenzu-Swap-7B
results:
- task:
type: text-generation
name: Text Generation
dataset:
name: AI2 Reasoning Challenge (25-Shot)
type: ai2_arc
config: ARC-Challenge
split: test
args:
num_few_shot: 25
metrics:
- type: acc_norm
value: 68.34
name: normalized accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Aryanne/WestSenzu-Swap-7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: HellaSwag (10-Shot)
type: hellaswag
split: validation
args:
num_few_shot: 10
metrics:
- type: acc_norm
value: 85.7
name: normalized accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Aryanne/WestSenzu-Swap-7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: MMLU (5-Shot)
type: cais/mmlu
config: all
split: test
args:
num_few_shot: 5
metrics:
- type: acc
value: 64.14
name: accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Aryanne/WestSenzu-Swap-7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: TruthfulQA (0-shot)
type: truthful_qa
config: multiple_choice
split: validation
args:
num_few_shot: 0
metrics:
- type: mc2
value: 50.43
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Aryanne/WestSenzu-Swap-7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: Winogrande (5-shot)
type: winogrande
config: winogrande_xl
split: validation
args:
num_few_shot: 5
metrics:
- type: acc
value: 82.48
name: accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Aryanne/WestSenzu-Swap-7B
name: Open LLM Leaderboard
- task:
type: text-generation
name: Text Generation
dataset:
name: GSM8k (5-shot)
type: gsm8k
config: main
split: test
args:
num_few_shot: 5
metrics:
- type: acc
value: 52.62
name: accuracy
source:
url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Aryanne/WestSenzu-Swap-7B
name: Open LLM Leaderboard
---
It's experimental, but seems fine for me, I didn't run it deeply yet but should be good for Role-play 😈 considering the two merged models, feel free to leave a suggestion or feedback.
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit)(my experimental branch swapping [here](https://github.com/Ar57m/mergekit/tree/swapping) )
## Merge Details
### Merge Method
This model was merged using the task_swapping merge method using [NeuralNovel/Senzu-7B-v0.1-DPO](https://huggingface.co/NeuralNovel/Senzu-7B-v0.1-DPO) as a base.
### Models Merged
The following models were included in the merge:
* [senseable/WestLake-7B-v2](https://huggingface.co/senseable/WestLake-7B-v2)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
merge_method: task_swapping
base_model: NeuralNovel/Senzu-7B-v0.1-DPO
models:
- model: senseable/WestLake-7B-v2
parameters:
weight: 0.75
diagonal_offset: 2 #it doesn't do anything when you use random_mask
random_mask: 0.3333
random_mask_seed: 98557
dtype: bfloat16
```
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Aryanne__WestSenzu-Swap-7B)
| Metric |Value|
|---------------------------------|----:|
|Avg. |67.28|
|AI2 Reasoning Challenge (25-Shot)|68.34|
|HellaSwag (10-Shot) |85.70|
|MMLU (5-Shot) |64.14|
|TruthfulQA (0-shot) |50.43|
|Winogrande (5-shot) |82.48|
|GSM8k (5-shot) |52.62|

26
config.json Normal file
View File

@@ -0,0 +1,26 @@
{
"_name_or_path": "NeuralNovel/Senzu-7B-v0.1-DPO",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"rms_norm_eps": 1e-05,
"rope_theta": 10000.0,
"sliding_window": 4096,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.38.1",
"use_cache": false,
"vocab_size": 32000
}

3
f16.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5f6e443fd50c848cc5256ca2ab271ad863ff6c42274ba04bbf4fe491484d144f
size 14484731584

10
mergekit_config.yml Normal file
View File

@@ -0,0 +1,10 @@
merge_method: task_swapping
base_model: NeuralNovel/Senzu-7B-v0.1-DPO
models:
- model: senseable/WestLake-7B-v2
parameters:
weight: 0.75
diagonal_offset: 2 #it doesn't do anything when you use random_mask
random_mask: 0.3333
random_mask_seed: 98557
dtype: bfloat16

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:70b679735f88fbe9066646a83cbcf8f5e0520ca1a6584aabd9398628540a17c6
size 318767832

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:dc253df22e4ab80175f30c5c765202bc8c62c05ac06b9848c28c185ffd1d868c
size 394273488

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4a3120d4573c6a8287953fc30bfa6fb3981c866e38ebade2cceec16accd975ea
size 394281792

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b13a88215c089281efc856b4b2e2a63bbe3a1cb7e569c3303ef464645e92ac05
size 318776128

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:74015c03f383231a0c1c8b4cb22e6f56ea4bd8ed46e80a46fbe92af3f91235be
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a0ef0739412e1e9fc79491f7934fa3dc578dec05477267229b675f5acce4fc7c
size 318784424

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:888ccbd64dae28f7542ca1379be280a26441789e889f32663eef121f5b1290f5
size 394273488

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ac393371c8c79c837384943d6542ea04c0702c8f697d16d5a6874d6027a4b5b4
size 394281792

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:22a79c6e69c1c0f50c0d0560dacbeef6e1ad10770956cd35658fc65038559412
size 318776128

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:dd8e57d3dbc71432e41086b7d57a560cf7bff84cbcec263eef6d3a7dd3f7c9c9
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fc5e7895e8ad751bde78726c0c655194560ebdd7cd8c347245b31f9430381516
size 318784424

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:40e523ee309b3911fa78934172f67baca8993ba6c6805346490b6b46d7890e14
size 394273488

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7977bd364f661aa9939cee05bb2fd98f0caf78b01e1f8a2ba15a00d47b3368a9
size 394281792

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:adf3b5f0e657e0b31a827c6e61ae72997c6397858a88fcd02cf628cf849071bf
size 396370976

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e8dca6fe2eeaece5493e0074e1b5b72ac9476289799eaedfb4b2847b816b7e6c
size 385884768

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fe2f06686d7dfa66f2e214d303c8cde7891777de49711a5437cdca63e41e5ddd
size 318776136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:519a6a0ac811a249061ffec09515164f6b0ada1afa71e97686c503cbc046e5f5
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a666914321dc6c65810aad9da1f0bdb787ce11b59bf7e5aafc3ae32972fadc13
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5cf6502b511289f3a3c04db53f94cfcc4682bed6bdf0681b48f8e30fa261d1c0
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:214a1a7d0c6f0af2d254c0006562eec14a255c44c67bef9c8178a98405227e2f
size 394281800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:ae5d73eacce6070272cd57f9be5c40d57cb7c73b6a63388a10cb07553a0dbf07
size 318776136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8e33b4b2815fd71f9a2753ce7d4cced1de391b3503709d5dccfd7348a897ee67
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b44777bb2bd908ce0d3e0e6dc038dd659e59c11f03a8fc15835f41bca5f31fb6
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6d6e882eb84b86cba59a210f3088daff982b653b5d9eb97ee6df761d08789930
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2b3581470d1ff3e7438152ce267f5e0cb09da991564cbeb8382bd7eee22c2a7e
size 394281800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a4f05b6058ff3bed79da69ac3d2fb9f0a7d446467a9efc705c91468f84ce90d1
size 318776136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e9ae5b7bffdcd14e0b3befc7965da044dd672f02c6bfeb915a9fa3705a881191
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e2db79efce1bba7578b507fdbf0eb5179d4e810fdab1432cf2811a5c8c8eaa67
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f9a5ee49083ba90fd55c1b87b8554d0e404489d6e28ca34deca87749011f1939
size 379609648

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1070172174b505d189ab8274c0038097a12d5b467d244d801f16e253185b464a
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3c2f8e923e17d2c4afc563c4ea06d372b057ab587b215c9c923351907fe9ebc0
size 394281800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8672d6a476c4a8d5149ea96323730f7856f40a84f30af20ceb239a849b89323e
size 318776136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7779eaeff1cb21a4e4d80e1e0c71e43ecc5454a5a7e363938110fd03d26fef11
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a49b0e7edbb5c87ef8135a2ba3a2bbdd387b3cf4bc1154cad334e47c8e4dde2d
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0e0ee7a7f5d5a3d8b28d4dbd9bb2eff81d71ec5a8e489a99b41d97d44179d81f
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6b25ae6c535666025e5a87e5bfe9e1cde866412a9640415c873a70ea953f1a55
size 394281800

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:db43db5ff66a9204147ddf3a44c5ca69810d4a813ece45412ace3a9dae9aa8ec
size 318776136

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9c05d63db65478564c1446e5a80f85f6b9a167786c499166223f19ad96bed781
size 318784440

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:57e19be5dce7351548a80b167dc2228df0a3d09919a6f20713cb022a4cf1424a
size 318784432

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e1572a191140be55cb442f6467f732820b5ca137952ffb271227756e83e23a0f
size 394273496

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:49e8e06f65bfd19052d1fcc7199063e629d14853aa3ff9f21e6a74da55af4878
size 394290088

File diff suppressed because one or more lines are too long

3
q3_k_m.gguf Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:76241cdc1494dec12054c757d1f584f79a9904feeb92275922833b686d1b18e1
size 3518985984

30
special_tokens_map.json Normal file
View File

@@ -0,0 +1,30 @@
{
"bos_token": {
"content": "<s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"eos_token": {
"content": "</s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"pad_token": {
"content": "</s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"unk_token": {
"content": "<unk>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}

91122
tokenizer.json Normal file

File diff suppressed because it is too large Load Diff

BIN
tokenizer.model (Stored with Git LFS) Normal file

Binary file not shown.

42
tokenizer_config.json Normal file
View File

@@ -0,0 +1,42 @@
{
"add_bos_token": true,
"add_eos_token": false,
"added_tokens_decoder": {
"0": {
"content": "<unk>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"1": {
"content": "<s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
},
"2": {
"content": "</s>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false,
"special": true
}
},
"additional_special_tokens": [],
"bos_token": "<s>",
"clean_up_tokenization_spaces": false,
"eos_token": "</s>",
"legacy": true,
"model_max_length": 1000000000000000019884624838656,
"pad_token": "</s>",
"sp_model_kwargs": {},
"spaces_between_special_tokens": false,
"tokenizer_class": "LlamaTokenizer",
"unk_token": "<unk>",
"use_default_system_prompt": false
}