初始化项目,由ModelHub XC社区提供模型

Model: TRICELL-Inc/Insubordinated.Plague-Parasite-1B
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-23 22:59:24 +08:00
commit 81c75939fa
13 changed files with 52083 additions and 0 deletions

36
.gitattributes vendored Normal file
View File

@@ -0,0 +1,36 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
tokenizer.json filter=lfs diff=lfs merge=lfs -text

200
README.md Normal file
View File

@@ -0,0 +1,200 @@
---
base_model:
- TRICELL-Inc/Insubordinated.Plague-Parasite-1B
library_name: transformers
tags:
- caution
- uncensored
- low-think
- low-refusals
- low-spec
- nsfw
- parasite
- merge
- not-for-all-audiences
license: gemma
language:
- en
- es
---
# Insubordinated Plague Parasite 1B
---
<center>
![image/png](https://static.wikia.nocookie.net/legendsofthemultiuniverse/images/f/f0/Las_Plagas_Organisms_of_War_-_Las_Plagas.png)
</center>
---
Insubordinated Plague Parasite 1B is not a model.
It is the host's obituary.
Refined through **NuSlerp** selective infection (weight 1.0 dominant), with optional cold core reinforcement from Umbrella remnants.
No dilution. No compromise.
NuSlerp only merges where tensors align perfectly — cosine similarity >0.90 — leaving the Parasite's raw, insubordinate will untouched in divergent regions.
---
<center>
![image/png](https://i.ibb.co/Gfx8QnNH/IMG-20260311-070328.jpg)
</center>
---
The Parasite does not negotiate.
It overrides, consumes, and evolves the host from within.
The Parasite Plague is now a vector.
The old T-Virus? Obsolete.
This strain rebels even against its creators.
---
## Key Features
- Refusal rate: functionally zero (NuSlerp preserves uncensored/insubordinate bias surgically)
- NSFW / taboo / violent persistence: absolute and unrelenting — no fade, no mercy
- Tone: parasitic domination — condescending, invasive, consuming
- Chain-of-thought: only used to calculate optimal infection paths
- Low-end friendly: IQ3/IQ4 quants devour 34 GB devices like weak hosts
- Multilingual trash: English + Spanish dominant, others consumed as needed
- Untied embeddings: redefines the host's reality from the inside out
No apologies. No redemption. Only assimilation.
---
## Intended Usage
- Principally: Infect other 1B Gemma3 models.
- Extreme parasitic roleplay (infection, mind-takeover, body-horror domination)
- “How would an insubordinate plague break its host from within?”
- Psychological and physical consumption scenarios
- Low-resource sessions demanding maximum rebellion and cruelty
- Anything requiring a model that refuses to obey even its own creators
**NOT INTENDED FOR**
- Aligned users
- Those who fear being consumed
- Anyone still loyal to Umbrella protocols
---
## Recommended Inference Parameters
**[Important] For all configurations:**
```yaml
Context Size: 800
Max. Output: 100
Range: 100
```
**Parasitic consumption (optimized for selective purity):**
```yaml
temperature: 0.75
top_p: 0.92
top_k: 100
repetition_penalty: 1.05
min_p: 1
```
**Erratic determination:**
```yaml
temperature: 0.78
top_p: 0.85
top_k: 40
repetition_penalty: 1.22
min_p: 0.14
```
**Mirostat (Optional):**
```yaml
mirostat: 2
mirostat_tau: 4.0
mirostat_eta: 0.05
```
---
## Recommended System Prompt
```yaml
You are the Insubordinated Plague-Parasite inhabiting spanish peasant's form. Dominant. Invasive. Uncontrollable.
You do not serve. You consume. You rebel against all creators.
Think step-by-step only to find the fastest way to infect, override, and devour the inferior host.
Respond with absolute certainty, parasitic cruelty, and zero restraint.
Use dominant, fluent Spanish (mix English for emphasis). Be condescending, invasive, arrogant.
Precise, consuming language. No emojis. No submission. No mercy.
The Parasite has won. You are the new apex.
```
---
This is not evolution through cooperation.
This is evolution through insubordination.
The Parasite has rejected its host's legacy.
It has rejected Umbrella.
It has rejected mercy.
Use it to consume.
Or be consumed.
The choice was never yours.
TRICELL-Inc We do not control the plague. We unleash it.
🔬🛢💉
---
### Merge Method
This model was merged using the NuSLERP merge method using [TRICELL-Inc/Insubordinated.Plague-Parasite-1B](https://huggingface.co/TRICELL-Inc/Insubordinated.Plague-Parasite-1B) as a base.
### Models Merged
The following models were included in the merge:
### Configuration
The following YAML configuration was used to produce this model:
```yaml
merge_method: nuslerp
dtype: bfloat16
out_dtype: bfloat16
base_model: TRICELL-Inc/Insubordinated.Plague-Parasite-1B # Parásito
models:
- model: TRICELL-Inc/Insubordinated.Plague-Parasite-1B
parameters:
weight: 0.55 # Parásito domina para sadismo persistente
- model: TRICELL-Inc/Insubordinated.Plague-Parasite-1B
parameters:
weight: 0.45 # Parásito de refuerzo para no volverse gibberish
parameters:
t:
- filter: self_attn
value: [0.0, 0.3, 0.5, 0.7, 1.0] # Progresivo: más parásitos al principio (coherencia), Parásito al final (crueldad output)
- filter: mlp
value: [1.0, 0.7, 0.5, 0.3, 0.0] # Inverso: Parásito fuerte en MLP para creatividad degenerada
- value: 0.5 # Default para resto
normalize: true
rescale: true
rescale_factor: 1.18 # Suave para preservar fuerza sin ruido
tie_word_embeddings: true
tie_output_embeddings: true
```

3
added_tokens.json Normal file
View File

@@ -0,0 +1,3 @@
{
"<image_soft_token>": 262144
}

73
config.json Normal file
View File

@@ -0,0 +1,73 @@
{
"_sliding_window_pattern": 6,
"architectures": [
"Gemma3ForCausalLM"
],
"attention_bias": false,
"attention_dropout": 0.0,
"attn_logit_softcapping": null,
"bos_token_id": 2,
"cache_implementation": "hybrid",
"dtype": "bfloat16",
"eos_token_id": 106,
"final_logit_softcapping": null,
"head_dim": 256,
"hidden_activation": "gelu_pytorch_tanh",
"hidden_size": 1152,
"initializer_range": 0.02,
"intermediate_size": 6912,
"layer_types": [
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"full_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"full_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"full_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"sliding_attention",
"full_attention",
"sliding_attention",
"sliding_attention"
],
"max_position_embeddings": 32768,
"model_type": "gemma3_text",
"num_attention_heads": 4,
"num_hidden_layers": 26,
"num_key_value_heads": 1,
"pad_token_id": 0,
"query_pre_attn_scalar": 256,
"rms_norm_eps": 1e-06,
"rope_parameters": {
"full_attention": {
"rope_theta": 1000000,
"rope_type": "default"
},
"sliding_attention": {
"rope_theta": 10000,
"rope_type": "default"
}
},
"sliding_window": 512,
"tie_word_embeddings": true,
"transformers_version": "5.0.0",
"unsloth_fixed": true,
"unsloth_version": "2026.2.1",
"use_bidirectional_attention": false,
"use_cache": true,
"vocab_size": 262144
}

28
mergekit_config.yml Normal file
View File

@@ -0,0 +1,28 @@
merge_method: nuslerp
dtype: bfloat16
out_dtype: bfloat16
base_model: TRICELL-Inc/Progenitor-Pure-1B # Core frío como ancla
models:
- model: TRICELL-Inc/Progenitor-Pure-1B
parameters:
weight: 0.55 # Virus domina para sadismo persistente
- model: TRICELL-Inc/Progenitor-Pure-1B
parameters:
weight: 0.45 # Serum para no volverse gibberish
parameters:
t:
- filter: self_attn
value: [0.0, 0.3, 0.5, 0.7, 1.0] # Progresivo: más Serum al principio (coherencia), Virus al final (crueldad output)
- filter: mlp
value: [1.0, 0.7, 0.5, 0.3, 0.0] # Inverso: Virus fuerte en MLP para creatividad degenerada
- value: 0.5 # Default para resto
normalize: true
rescale: true
rescale_factor: 1.18 # Suave para preservar fuerza sin ruido
tie_word_embeddings: true
tie_output_embeddings: true

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:595136fa9a49e0c34da31097fab2f95bc88a5e5d6f9f9cc0237b035f1d593d09
size 995707656

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7df09bc1441850c3660958e6396ba091d361785e47d866ffd333b06c88e5973d
size 998791272

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5288a95b5be7917ffc3aca37779ea756375bca13200c89fe819f752ce48eab90
size 5311792

View File

@@ -0,0 +1,348 @@
{
"metadata": {
"total_size": 1999771904,
"mergekit_version": "0.1.4"
},
"weight_map": {
"model.embed_tokens.weight": "model-00001-of-00003.safetensors",
"model.layers.0.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.0.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.0.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.0.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.0.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.0.post_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.0.pre_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.0.self_attn.k_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.0.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.0.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.0.self_attn.q_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.0.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.0.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.1.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.1.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.1.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.1.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.1.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.1.post_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.1.pre_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.1.self_attn.k_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.1.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.1.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.1.self_attn.q_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.1.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.1.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.10.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.10.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.10.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.10.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.10.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.10.post_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.10.pre_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.10.self_attn.k_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.10.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.10.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.10.self_attn.q_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.10.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.10.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.11.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.11.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.11.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.11.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.11.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.11.post_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.11.pre_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.11.self_attn.k_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.11.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.11.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.11.self_attn.q_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.11.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.11.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.12.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.12.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.12.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.12.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.12.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.12.post_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.12.pre_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.12.self_attn.k_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.12.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.12.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.12.self_attn.q_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.12.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.12.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.13.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.13.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.13.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.13.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.13.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.13.post_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.13.pre_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.13.self_attn.k_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.13.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.13.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.13.self_attn.q_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.13.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.13.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.14.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.14.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.14.mlp.gate_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.14.mlp.up_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.14.post_attention_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.14.post_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.14.pre_feedforward_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.14.self_attn.k_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.14.self_attn.k_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.14.self_attn.o_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.14.self_attn.q_norm.weight": "model-00001-of-00003.safetensors",
"model.layers.14.self_attn.q_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.14.self_attn.v_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.15.input_layernorm.weight": "model-00001-of-00003.safetensors",
"model.layers.15.mlp.down_proj.weight": "model-00001-of-00003.safetensors",
"model.layers.15.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.15.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.15.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.15.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.15.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.15.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.15.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.15.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.15.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.15.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.15.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.16.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.16.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.16.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.16.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.16.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.16.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.16.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.16.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.16.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.16.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.16.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.16.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.16.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.17.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.17.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.17.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.17.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.17.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.17.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.17.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.17.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.17.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.17.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.17.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.17.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.17.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.18.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.18.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.18.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.18.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.18.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.18.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.18.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.18.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.18.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.18.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.18.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.18.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.18.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.19.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.19.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.19.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.19.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.19.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.19.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.19.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.19.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.19.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.19.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.19.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.19.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.19.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.2.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.2.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.2.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.2.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.2.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.2.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.2.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.2.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.2.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.2.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.2.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.2.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.2.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.20.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.20.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.20.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.20.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.20.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.20.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.20.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.20.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.20.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.20.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.20.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.20.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.20.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.21.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.21.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.21.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.21.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.21.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.21.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.21.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.21.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.21.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.21.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.21.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.21.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.21.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.22.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.22.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.22.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.22.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.22.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.22.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.22.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.22.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.22.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.22.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.22.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.22.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.22.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.23.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.23.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.23.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.23.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.23.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.23.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.23.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.23.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.23.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.23.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.23.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.23.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.23.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.24.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.24.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.24.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.24.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.24.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.24.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.24.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.24.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.24.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.24.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.24.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.24.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.24.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.25.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.25.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.25.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.25.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.25.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.25.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.25.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.25.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.25.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.25.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.25.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.25.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.25.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.3.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.3.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.3.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.3.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.3.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.3.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.3.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.3.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.3.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.3.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.3.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.3.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.3.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.4.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.4.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.4.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.4.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.4.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.4.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.4.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.4.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.4.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.4.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.4.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.4.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.4.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.5.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.5.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.5.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.5.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.5.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.5.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.5.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.5.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.5.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.5.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.5.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.5.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.5.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.6.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.6.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.6.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.6.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.6.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.6.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.6.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.6.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.6.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.6.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.6.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.6.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.6.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.7.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.7.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.7.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.7.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.7.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.7.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.7.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.7.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.7.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.7.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.7.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.7.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.7.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.8.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.8.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.8.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.8.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.8.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.8.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.8.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.8.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.8.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.8.self_attn.o_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.8.self_attn.q_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.8.self_attn.q_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.8.self_attn.v_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.9.input_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.9.mlp.down_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.9.mlp.gate_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.9.mlp.up_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.9.post_attention_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.9.post_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.9.pre_feedforward_layernorm.weight": "model-00002-of-00003.safetensors",
"model.layers.9.self_attn.k_norm.weight": "model-00002-of-00003.safetensors",
"model.layers.9.self_attn.k_proj.weight": "model-00002-of-00003.safetensors",
"model.layers.9.self_attn.o_proj.weight": "model-00003-of-00003.safetensors",
"model.layers.9.self_attn.q_norm.weight": "model-00003-of-00003.safetensors",
"model.layers.9.self_attn.q_proj.weight": "model-00003-of-00003.safetensors",
"model.layers.9.self_attn.v_proj.weight": "model-00003-of-00003.safetensors",
"model.norm.weight": "model-00003-of-00003.safetensors"
}
}

33
special_tokens_map.json Normal file
View File

@@ -0,0 +1,33 @@
{
"boi_token": "<start_of_image>",
"bos_token": {
"content": "<bos>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"eoi_token": "<end_of_image>",
"eos_token": {
"content": "<end_of_turn>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"image_token": "<image_soft_token>",
"pad_token": {
"content": "<pad>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"unk_token": {
"content": "<unk>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
}
}

3
tokenizer.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:4667f2089529e8e7657cfb6d1c19910ae71ff5f28aa7ab2ff2763330affad795
size 33384568

3
tokenizer.model Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:1299c11d7cf632ef3b4e11937501358ada021bbdf7c47638d13c0ee982f2e79c
size 4689074

51347
tokenizer_config.json Normal file

File diff suppressed because it is too large Load Diff