初始化项目,由ModelHub XC社区提供模型

Model: KoboldAI/GPT-NeoX-20B-Skein
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-01 15:29:08 +08:00
commit 689bc651df
33 changed files with 50209 additions and 0 deletions

50
.gitattributes vendored Normal file
View File

@@ -0,0 +1,50 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bin.* filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zstandard filter=lfs diff=lfs merge=lfs -text
*.tfevents* filter=lfs diff=lfs merge=lfs -text
*.db* filter=lfs diff=lfs merge=lfs -text
*.ark* filter=lfs diff=lfs merge=lfs -text
**/*ckpt*data* filter=lfs diff=lfs merge=lfs -text
**/*ckpt*.meta filter=lfs diff=lfs merge=lfs -text
**/*ckpt*.index filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.gguf* filter=lfs diff=lfs merge=lfs -text
*.ggml filter=lfs diff=lfs merge=lfs -text
*.llamafile* filter=lfs diff=lfs merge=lfs -text
*.pt2 filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
tokenizer.json filter=lfs diff=lfs merge=lfs -text
vocab.json filter=lfs diff=lfs merge=lfs -text

41
README.md Normal file
View File

@@ -0,0 +1,41 @@
---
language: en
license: apache-2.0
---
# GPT-NeoX-20B-Skein
## Model description
Skein is a series of hybrid story generation models intended for use in both text adventure writing and normal novel-style writing. The models are known to possess a strong second person bias. For inquiries, please contact the KoboldAI community.
The name comes from the Integrated Development Environment for the Inform 7 programming language, which calls a dialogue tree a "skein". Inform 6 and 7 were used to create some of the interactive fiction in the dataset.
## Training procedure
GPT-NeoX-20B-Skein was trained on a TPUv3-32 TPU pod using a heavily modified version of Ben Wang's Mesh Transformer JAX library, the original version of which was used by EleutherAI to train their GPT-J-6B model. The training hyperparameters and statistics can be found [here](https://wandb.ai/ve-forbryderne/skein-20b?workspace=user-ve-forbryderne).
## Training data
The data are mostly comprised of light novels from the dataset of the [KoboldAI/GPT-Neo-2.7B-Horni-LN](https://huggingface.co/KoboldAI/GPT-Neo-2.7B-Horni-LN) model and assorted interactive fiction. The dataset uses `[Themes: <comma-separated list of genres>]` for tagging. For more details, consult [this document](https://wandb.ai/ve-forbryderne/skein/runs/files/files/datasets/README.txt).
## Limitations and biases
Based on known problems with NLP technology, potential relevant factors include bias (gender, profession, race and religion).
## Citation details
The GPT-NeoX-20B model weights:
```bibtex
@inproceedings{gpt-neox-20b,
title={{GPT-NeoX-20B}: An Open-Source Autoregressive Language Model},
author={Black, Sid and Biderman, Stella and Hallahan, Eric and Anthony, Quentin and Gao, Leo and Golding, Laurence and He, Horace and Leahy, Connor and McDonell, Kyle and Phang, Jason and Pieler, Michael and Prashanth, USVSN Sai and Purohit, Shivanshu and Reynolds, Laria and Tow, Jonathan and Wang, Ben and Weinbach, Samuel},
booktitle={Proceedings of the ACL Workshop on Challenges \& Perspectives in Creating Large Language Models},
url={https://arxiv.org/abs/2204.06745},
year={2022}
}
```
The Mesh Transformer JAX library:
```bibtex
@misc{mesh-transformer-jax,
author = {Wang, Ben},
title = {{Mesh-Transformer-JAX: Model-Parallel Implementation of Transformer Language Model with JAX}},
howpublished = {\url{https://github.com/kingoflolz/mesh-transformer-jax}},
year = 2021,
month = May
}
```

28
config.json Normal file
View File

@@ -0,0 +1,28 @@
{
"architectures": [
"GPTNeoXForCausalLM"
],
"attention_probs_dropout_prob": 0.1,
"bos_token_id": 0,
"eos_token_id": 0,
"hidden_act": "gelu_fast",
"hidden_dropout_prob": 0.1,
"hidden_size": 6144,
"initializer_range": 0.02,
"intermediate_size": 24576,
"layer_norm_eps": 1e-05,
"max_position_embeddings": 2048,
"model_type": "gpt_neox",
"num_attention_heads": 64,
"num_hidden_layers": 44,
"rotary_emb_base": 10000,
"rotary_pct": 0.25,
"tie_word_embeddings": false,
"torch_dtype": "float16",
"transformers_version": "4.19.0.dev0",
"use_cache": true,
"adventure": true,
"rep_pen": 1.03,
"max_length": 2048,
"vocab_size": 50432
}

1
configuration.json Normal file
View File

@@ -0,0 +1 @@
{"framework": "pytorch", "task": "text-generation", "allow_remote": true}

50009
merges.txt Normal file

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:92a1017920ad1730e9c0449b0791334d9fe915370e128aebcc9bf125fa921b63
size 1836322580

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b07403f1e563d8ef7d556fe14678e73fdcac6514c71fff31ada5ac8bfbdbf573
size 1820656139

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7419446c7b9cd61bd827e075afe51f9c8bf1c2b7c7bdc6cc5943a1b942a79cdf
size 1820656139

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0ba84aa0d487ade4f2dbd6dcc3f90f1be67fc858120cce9db1e52ff5dfa4409a
size 1820656139

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b387f00794a37274563294ab965e22968fc1313701c7eb6d940bccfc88505a3c
size 1820656139

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:63d5810b4802a41a6142c38bff031fb621c25195eb3367bf55081a9c7c025ea3
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:56a3c77b50a5a502c58b406cf5004855827861ffaf7bb22782e0ff4b304a694b
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:397fed797f1f9bd58943b616b003cf64b7ddb6feef75c3ce4f9b5de0af1ae0fc
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:35cb9f7f5f7dfd634f7e69a788e890b9fc76a03e224e1d4b523457b261d927a8
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:f870a31e20562c9c06a2ba787ef2afb5114b80bfaea8e7b5081df9006d07148f
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:6272e879574c70513399d8abd53caa333bf45b6b46bf9261011c761793484ea5
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:aa6dd6dcb588d97c67321f19c8b876c38fe2af0001251296ac0ff4db5967e04e
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a842c82fd06a6d80464d5dc8921ed4b2c43222746d66a20764dc474c710bfb47
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:349472761424e3669924f5ef0b652f60e2b5afa8077ff0f4092565a0f2544a79
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:489ef56ea3f000340b56100545c9a2423b3ec98fd1f8c3adc5965184f6946da7
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:41bd155b401e5830ecdea8af1e858537b58b27d9069e0080d9de9f341c561ae8
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:a5e1045699b565f6fa4f6b240b2e4e725253d1cf1c144e0afa93b705ed9dae63
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:8197eeef50964a2b696d6d4db6615521d212430283951c181b35b09c98bd10d0
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:9ccda7a8cbb316e9bfe92f1d52f873d9a141090c3537c1bb4f1390ea38d37bb3
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d40ce1a1541f2ec6fd0e025db417b61854aa943e2ab2c3890f42d5e00a42ffcc
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1d3146d1c6165c3977da25b0398df9898d94c8c392d5d034d7e79f2414eb0fa4
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:765bc29a3df44a2ca3c8d9e48a1202816730a4c884a5f26fc8de8f87fa3b17f8
size 1820656203

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:b5cdbf93430a48b5b4001ad072261ae410eab82abc6c3d70b01f3902c29a0c54
size 1223776339

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:d63e4605d0dad0424baee35abf49a1836a7ba71540fa14a09e1229921f22c229
size 57712

1
special_tokens_map.json Normal file
View File

@@ -0,0 +1 @@
{"bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "unk_token": "<|endoftext|>"}

3
tokenizer.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e74ca9064c4f0d2232253bfe5f48caa78b6827212e87d831899d8fc64542e62b
size 2113711

1
tokenizer_config.json Normal file
View File

@@ -0,0 +1 @@
{"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "tokenizer_class": "GPTNeoXTokenizer"}

3
vocab.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5ea14061eabeb16dba865236c04e8b5689597a843e8fc2b73ebf816ae3253306
size 1077393