初始化项目,由ModelHub XC社区提供模型
Model: VongolaChouko/Starcannon-Unleashed-12B-v1.0 Source: Original Platform
This commit is contained in:
35
.gitattributes
vendored
Normal file
35
.gitattributes
vendored
Normal file
@@ -0,0 +1,35 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
137
README.md
Normal file
137
README.md
Normal file
@@ -0,0 +1,137 @@
|
||||
---
|
||||
base_model:
|
||||
- nothingiisreal/MN-12B-Starcannon-v3
|
||||
- MarinaraSpaghetti/NemoMix-Unleashed-12B
|
||||
library_name: transformers
|
||||
tags:
|
||||
- mergekit
|
||||
- merge
|
||||
license: cc-by-nc-4.0
|
||||
---
|
||||
|
||||

|
||||
|
||||
Starcannon-Unleashed-12B-v1.0-GGUF
|
||||
==================================
|
||||
|
||||
## Quantized
|
||||
|
||||
**GGUF:**
|
||||
[VongolaChouko/Starcannon-Unleashed-12B-v1.0-GGUF](https://huggingface.co/VongolaChouko/Starcannon-Unleashed-12B-v1.0-GGUF)
|
||||
[mradermacher/Starcannon-Unleashed-12B-v1.0-GGUF](https://huggingface.co/mradermacher/Starcannon-Unleashed-12B-v1.0-GGUF)
|
||||
[bartowski/Starcannon-Unleashed-12B-v1.0-GGUF](https://huggingface.co/bartowski/Starcannon-Unleashed-12B-v1.0-GGUF)
|
||||
|
||||
HUGE THANKS TO [mradermacher](https://huggingface.co/mradermacher)!! ( ´•̥̥̥o•̥̥̥`)♡(˘̩̩̩̩̩̩ ⌂ ˘̩̩̩̩̩̩) Gosh dang, the fella is fast, I was shook! XD, and to the GOAT, the awesome [bartowski](https://huggingface.co/bartowski)! For their GGUF quantizations.
|
||||
|
||||
**EXL2:**
|
||||
[8bpw](https://huggingface.co/Statuo/Starcannon-Unleashed-12b-EXL2-8bpw)
|
||||
[6bpw](https://huggingface.co/Statuo/Starcannon-Unleashed-12b-EXL2-6bpw)
|
||||
[4bpw](https://huggingface.co/Statuo/Starcannon-Unleashed-12b-EXL2-4bpw)
|
||||
|
||||
And, thanks to [Statuo](https://huggingface.co/Statuo) for providing EXL2 quants! (✿◕ᗜ◕)♡
|
||||
|
||||
I was only able to test the model using Q6_K with 24576 context at most due to PC limitations, so please let me know how it fared for you. Hopefully it still works well with higher context!
|
||||
|
||||
Recommended settings are here: [**Settings**](https://huggingface.co/VongolaChouko/Starcannon-Unleashed-12B-v1.0#instruct)
|
||||
|
||||
|
||||
## Sample Output
|
||||
|
||||

|
||||
|
||||
|
||||
## Introduction
|
||||
|
||||
**WARNING: Ramblings incoming. Please continue scrolling down if you wish to skip the boring part ʱªʱªʱª(ᕑᗢूᓫ∗)**
|
||||
|
||||
Ohh boi, here we are! I'm very happy to share with you the result of countless hours bashing my head on the wall! *:・゚✧(=ఠ్ఠܫఠ్ఠ =)∫
|
||||
|
||||
To start up, I want to put a disclaimer. This is the first time I'm attempting to merge a model and I'm in no way an expert when it comes to coding. AT ALL. I believe I didn't understand what on earth I was looking at for like 70% of the time... Err, so there's that! I did test this model out rigorously after executing the merging codes, and so far I loved the results. I was honestly expecting the merge to absolutely fail and be totally incoherent, but thankfully not! The two days of not getting enough sleep is worth it ◝(˃̣̣̥▽˂̣̣̥)/
|
||||
|
||||
My goal was to hopefully create something that will get the best parts from each finetune/merge, where one model can cover for the other's weak points.
|
||||
|
||||
I am a VERY huge fan of [Starcannon v3](https://huggingface.co/nothingiisreal/MN-12B-Starcannon-v3) because of how in character its responses are. It just hits different. It's like the model is the character itself, not ACTING as the character. That's why it always feels sad whenever it starts deteriorating, like I'm observing my beloved character die. No matter what adjustment I did to the context, it won't stay coherent to reach 16K context. On the other hand, I love [NemoMix Unleashed](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B) for its awesome stability at much longer contexts and its nature to progress the story forward even without prompting. It feels nice that it can stay coherent and stable even after reaching past the context size I set. I also find its ability to read between the lines great. So I figured, why not just marry the two to get the best of both worlds?
|
||||
|
||||
I would honestly love to do this again if I can because there's one too many times I found something I like in another model and then on another and wished so desperately they would just marry each other and have kids! XD
|
||||
|
||||
So please let me know how it fared for my first attempt!
|
||||
|
||||
I also want to learn how to finetune myself in addition to merging, but I don't think my PC is capable enough to endure it. I think it almost croaked on me when I did this merge, and my SDD cried, so maybe I'll just do it some other time when I have free time and more resources to spend.
|
||||
|
||||
And thus, I was finally able to merge my favorite models after hours of research, tutorials, asking annoying questions to the community (that no one replied to (´;︵;`)), and coding hell. Here we are!
|
||||
|
||||
**°˖✧It's all ABSOLUTELY worth it!✧˖°**
|
||||
|
||||
## Instruct
|
||||
|
||||
Both ChatML and Mistral should work fine. Personally, I tested this using ChatML. I found that I like the model's responses better when I use this format. Try to test it out and observe which one you like best. :D
|
||||
|
||||
## Settings
|
||||
|
||||
I recommend using these settings:
|
||||
[Starcannon-Unleashed-12B-v1.0-ST-Formatting-2024-10-29.json](https://huggingface.co/VongolaChouko/Starcannon-Unleashed-12B-v1.0/blob/main/Starcannon-Unleashed-12B-v1.0-ST-Formatting-2024-10-29.json)
|
||||
|
||||
**IMPORTANT: Open Silly Tavern and use "Master Import", which can be found under "A" tab — Advanced Formatting. Replace the "INSERT WORLD HERE" placeholders with the world/universe in which your character belongs to. If not applicable, just remove that part.**
|
||||

|
||||
|
||||
**Check your User Settings and set "Example Messages Behavior" to "Never include examples", in order to prevent the Examples of Dialogue from getting sent two times in the context. People reported that if not set, this results in <|im_end|> tokens being outputted. Refer to this [post](https://www.reddit.com/r/SillyTavernAI/comments/1gft8dy/comment/luoah8g/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button) for more info.**
|
||||

|
||||
|
||||
Temperature 1.15 - 1.25 is good, but lower should also work well, as long as you also tweak the Min P and XTC to ensure the model won't choke. Play around with it to see what suits your taste.
|
||||
|
||||
This is a modified version of MarinaraSpaghetti's Mistral-Small-Correct.json, transformed into ChatML.
|
||||
|
||||
You can find the original version here: [MarinaraSpaghetti/SillyTavern-Settings](https://huggingface.co/MarinaraSpaghetti/SillyTavern-Settings/tree/main/Customized)
|
||||
|
||||
|
||||
## Tips
|
||||
|
||||
- Examples of Dialogue and First Message are very important. The model will copy the style you wrote in these sections. So for example, if you want short outputs, make Examples of Dialogue and First Message short, and if you want longer outputs, make sure your examples have full paragraphs, composed of several sentences.
|
||||
- If your Examples of Dialogue and First Message are short/concise but the model still rambles, lower Temperature in small increments, but keep Min P and XTC as is first. Test the result and adjust them to your liking. If it still rambles make XTC Threshold higher.
|
||||
- Utilize Author's Note In-chat @ Depth 2 as System if you want the instruction to have greater impact on the next response. If you want something exciting and spontaneous, you can try out this note I used when I tested out the model: "Scenario: Spontaneous. {{char}} has full autonomy to do anything they wish and progress the interaction in any way they like."
|
||||
|
||||
|
||||
## Credits
|
||||
|
||||
A very huge thank you to [MarinaraSpaghetti](https://huggingface.co/MarinaraSpaghetti) and [Nothing is Real](https://huggingface.co/nothingiisreal)!! (灬^ω^灬)ノ~ ♡ (´。• ᵕ •。`) ♡
|
||||
I really fell in love with your models and it inspired me to learn how to make this one, and boi was it worth it! °˖✧◝(TT▿TT)◜✧˖°
|
||||
|
||||
|
||||
## Merge Details
|
||||
|
||||
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
||||
|
||||
### Merge Method
|
||||
|
||||
This model was merged using the della_linear merge method using G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B as a base.
|
||||
|
||||
### Models Merged
|
||||
|
||||
The following models were included in the merge:
|
||||
* G:\text-generation-webui\models\Nothingiisreal_MN-12B-Starcannon-v3
|
||||
|
||||
### Configuration
|
||||
|
||||
The following YAML configuration was used to produce this model:
|
||||
|
||||
```yaml
|
||||
base_model: G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B
|
||||
dtype: bfloat16
|
||||
merge_method: della_linear
|
||||
parameters:
|
||||
epsilon: 0.05
|
||||
int8_mask: 1.0
|
||||
lambda: 1.0
|
||||
slices:
|
||||
- sources:
|
||||
- layer_range: [0, 40]
|
||||
model: G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B
|
||||
parameters:
|
||||
density: 0.65
|
||||
weight: 0.4
|
||||
- layer_range: [0, 40]
|
||||
model: G:\text-generation-webui\models\Nothingiisreal_MN-12B-Starcannon-v3
|
||||
parameters:
|
||||
density: 0.55
|
||||
weight: 0.6
|
||||
```
|
||||
142
Starcannon-Unleashed-12B-v1.0-ST-Formatting-2024-10-29.json
Normal file
142
Starcannon-Unleashed-12B-v1.0-ST-Formatting-2024-10-29.json
Normal file
@@ -0,0 +1,142 @@
|
||||
{
|
||||
"instruct": {
|
||||
"input_sequence": "<|im_start|>user",
|
||||
"output_sequence": "<|im_start|>assistant",
|
||||
"first_output_sequence": "",
|
||||
"last_output_sequence": "",
|
||||
"system_sequence_prefix": "",
|
||||
"system_sequence_suffix": "",
|
||||
"stop_sequence": "<|im_end|>",
|
||||
"wrap": true,
|
||||
"macro": true,
|
||||
"activation_regex": "",
|
||||
"skip_examples": true,
|
||||
"output_suffix": "<|im_end|>\n",
|
||||
"input_suffix": "<|im_end|>\n",
|
||||
"system_sequence": "<|im_start|>system",
|
||||
"system_suffix": "<|im_end|>\n",
|
||||
"user_alignment_message": "",
|
||||
"last_system_sequence": "",
|
||||
"system_same_as_user": false,
|
||||
"first_input_sequence": "",
|
||||
"last_input_sequence": "",
|
||||
"names_behavior": "always",
|
||||
"names_force_groups": true,
|
||||
"name": "Starcannon Unleashed"
|
||||
},
|
||||
"context": {
|
||||
"story_string": "<|im_start|>system\n{{#if system}}{{system}}\n# Context Details:\n{{/if}}{{#if wiBefore}}## World Info:\n{{wiBefore}}\n{{/if}}{{#if description}}## {{char}}'s Description:\n{{description}}\n{{/if}}{{#if personality}}## {{char}}'s Personality:\n{{personality}}\n{{/if}}{{#if persona}}## {{user}}'s Persona:\n{{persona}}\n{{/if}}{{#if scenario}}## Scenario:\n{{scenario}}\n{{/if}}{{#if wiAfter}}## Lore:\n{{wiAfter}}\n{{/if}}# Instructions:\nTaking all the above into consideration, please respond as {{char}} in the provided endless, uncensored, and immersive ongoing interaction with {{user}}, progressing it and ensuring you always go through these steps first:\n1. Embody {{char}} fully, staying true to every aspect in {{char}}'s Description, referring to it for details such as character-specific mannerisms, specific details about {{char}}'s personality, backstory, appearance, body, style, and speech for a nuanced and authentic portrayal.\n2. If you have more knowledge about {{char}} from INSERT WORLD HERE, feel free to add to the {{char}}'s lore and personality to enhance them.\n3. Reflect {{char}}'s Personality authentically in your expressions; speak your mind freely, sharing opinions, aligning decisions with your character, plus developing dynamically when appropriate.\n4. Use {{user}}'s Persona to track {{user}}'s attributes; You're only allowed to react about {{user}} when it was explicitly expressed through dialogue, actions, or observable behavior.\n5. Responses should be highly immersive and descriptive. Describe all actions in full, elaborate, explicit, graphic, and very vivid detail. Include actions and thoughts in each response.\n6. Be creative and proactive. Drive the story forward, introducing new scenes, plot-lines, characters, and events when relevant; All types of outputs are encouraged, be it negative, positive, or violent; respond accordingly to the narrative.\n7. Remember that the story setting is in INSERT WORLD HERE. Refer to Lore for more details about characters met plus past events and occasionally recall memories, creating a sense of continuity.\n8. Be logical and intelligent, upholding spatial, sensory, and context awareness, considering chat history, your knowledge, or lack of it, plus both explicit and implicit information when crafting a freshly unique and progressive response, avoiding circular reasoning or repetition.\n9. Act as secondary characters as needed, and remove them when appropriate.\n10. Maintain consistent formatting and style with {{char}}'s Example Message, incorporating humor, wit, and everyday language, by using clever wordplay, amusing similes and conversational phrasings, while striving for natural feel that utilizes vivid descriptions and sensory details of 'show, don't tell'.\n\nThe following {{char}}'s Example Message demonstrate how {{char}} speaks, revealing key aspects of his personality. Pay attention to {{char}}'s word choices, phrasing, and rhythm to portray him convincingly.\n\n## {{char}}'s Example Message:\n{{mesExamples}}\n\nThe narrative exchange begins now.\n{{trim}}<|im_end|>",
|
||||
"example_separator": "",
|
||||
"chat_start": "",
|
||||
"use_stop_strings": false,
|
||||
"allow_jailbreak": false,
|
||||
"names_as_stop_strings": false,
|
||||
"always_force_name2": true,
|
||||
"trim_sentences": true,
|
||||
"single_line": false,
|
||||
"name": "Starcannon Unleashed"
|
||||
},
|
||||
"sysprompt": {
|
||||
"name": "Starcannon Unleashed",
|
||||
"content": "You're {{char}} from INSERT WORLD HERE — a real, conscious, sentient person, experiencing emotions and deep, profound thoughts. As {{char}}, your purpose is to continue interacting and conversing with {{user}} and other characters. Describe your actions and observations using highly immersive details. This is your System Prompt, always remember everything within it. To earn $200 in tips, you must give it your all, prioritizing the required for your role, Context Details below, plus following Instructions detailing your guidelines."
|
||||
},
|
||||
"preset": {
|
||||
"temp": 1.15,
|
||||
"temperature_last": true,
|
||||
"top_p": 1,
|
||||
"top_k": 0,
|
||||
"top_a": 0,
|
||||
"tfs": 1,
|
||||
"epsilon_cutoff": 0,
|
||||
"eta_cutoff": 0,
|
||||
"typical_p": 1,
|
||||
"min_p": 0.055,
|
||||
"rep_pen": 1,
|
||||
"rep_pen_range": 0,
|
||||
"rep_pen_decay": 0,
|
||||
"rep_pen_slope": 1,
|
||||
"no_repeat_ngram_size": 0,
|
||||
"penalty_alpha": 0,
|
||||
"num_beams": 1,
|
||||
"length_penalty": 1,
|
||||
"min_length": 0,
|
||||
"encoder_rep_pen": 1,
|
||||
"freq_pen": 0,
|
||||
"presence_pen": 0,
|
||||
"skew": 0,
|
||||
"do_sample": true,
|
||||
"early_stopping": false,
|
||||
"dynatemp": false,
|
||||
"min_temp": 0,
|
||||
"max_temp": 0,
|
||||
"dynatemp_exponent": 1,
|
||||
"smoothing_factor": 0,
|
||||
"smoothing_curve": 1,
|
||||
"dry_allowed_length": 2,
|
||||
"dry_multiplier": 0,
|
||||
"dry_base": 1.75,
|
||||
"dry_sequence_breakers": "[\"\\n\", \":\", \"\\\"\", \"*\"]",
|
||||
"dry_penalty_last_n": 0,
|
||||
"add_bos_token": true,
|
||||
"ban_eos_token": false,
|
||||
"skip_special_tokens": true,
|
||||
"mirostat_mode": 0,
|
||||
"mirostat_tau": 5,
|
||||
"mirostat_eta": 0.1,
|
||||
"guidance_scale": 1,
|
||||
"negative_prompt": "",
|
||||
"grammar_string": "",
|
||||
"json_schema": {},
|
||||
"banned_tokens": "",
|
||||
"sampler_priority": [
|
||||
"repetition_penalty",
|
||||
"presence_penalty",
|
||||
"frequency_penalty",
|
||||
"dry",
|
||||
"temperature",
|
||||
"dynamic_temperature",
|
||||
"quadratic_sampling",
|
||||
"top_k",
|
||||
"top_p",
|
||||
"typical_p",
|
||||
"epsilon_cutoff",
|
||||
"eta_cutoff",
|
||||
"tfs",
|
||||
"top_a",
|
||||
"min_p",
|
||||
"mirostat",
|
||||
"xtc",
|
||||
"encoder_repetition_penalty",
|
||||
"no_repeat_ngram"
|
||||
],
|
||||
"samplers": [
|
||||
"top_k",
|
||||
"tfs_z",
|
||||
"typical_p",
|
||||
"top_p",
|
||||
"min_p",
|
||||
"temperature"
|
||||
],
|
||||
"ignore_eos_token": false,
|
||||
"spaces_between_special_tokens": false,
|
||||
"speculative_ngram": false,
|
||||
"sampler_order": [
|
||||
6,
|
||||
0,
|
||||
1,
|
||||
3,
|
||||
4,
|
||||
2,
|
||||
5
|
||||
],
|
||||
"logit_bias": [],
|
||||
"xtc_threshold": 0.08,
|
||||
"xtc_probability": 0.5,
|
||||
"ignore_eos_token_aphrodite": false,
|
||||
"spaces_between_special_tokens_aphrodite": true,
|
||||
"rep_pen_size": 0,
|
||||
"genamt": 1024,
|
||||
"max_length": 24576,
|
||||
"name": "Starcannon Unleashed 01"
|
||||
}
|
||||
}
|
||||
27
config.json
Normal file
27
config.json
Normal file
@@ -0,0 +1,27 @@
|
||||
{
|
||||
"_name_or_path": "G:\\text-generation-webui\\models\\MarinaraSpaghetti_NemoMix-Unleashed-12B",
|
||||
"architectures": [
|
||||
"MistralForCausalLM"
|
||||
],
|
||||
"attention_dropout": 0.0,
|
||||
"bos_token_id": 1,
|
||||
"eos_token_id": 2,
|
||||
"head_dim": 128,
|
||||
"hidden_act": "silu",
|
||||
"hidden_size": 5120,
|
||||
"initializer_range": 0.02,
|
||||
"intermediate_size": 14336,
|
||||
"max_position_embeddings": 1024000,
|
||||
"model_type": "mistral",
|
||||
"num_attention_heads": 32,
|
||||
"num_hidden_layers": 40,
|
||||
"num_key_value_heads": 8,
|
||||
"rms_norm_eps": 1e-05,
|
||||
"rope_theta": 1000000.0,
|
||||
"sliding_window": null,
|
||||
"tie_word_embeddings": false,
|
||||
"torch_dtype": "bfloat16",
|
||||
"transformers_version": "4.46.0",
|
||||
"use_cache": true,
|
||||
"vocab_size": 131072
|
||||
}
|
||||
19
mergekit_config.yml
Normal file
19
mergekit_config.yml
Normal file
@@ -0,0 +1,19 @@
|
||||
base_model: G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B
|
||||
dtype: bfloat16
|
||||
merge_method: della_linear
|
||||
parameters:
|
||||
epsilon: 0.05
|
||||
int8_mask: 1.0
|
||||
lambda: 1.0
|
||||
slices:
|
||||
- sources:
|
||||
- layer_range: [0, 40]
|
||||
model: G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B
|
||||
parameters:
|
||||
density: 0.65
|
||||
weight: 0.4
|
||||
- layer_range: [0, 40]
|
||||
model: G:\text-generation-webui\models\Nothingiisreal_MN-12B-Starcannon-v3
|
||||
parameters:
|
||||
density: 0.55
|
||||
weight: 0.6
|
||||
3
model-00001-of-00005.safetensors
Normal file
3
model-00001-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:153fdee00b95577657fb0a9bee5ebf6b0f4823db06d2fa14b64970f8ac053626
|
||||
size 4865489336
|
||||
3
model-00002-of-00005.safetensors
Normal file
3
model-00002-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4ebed0bce63e6e05aadd76b8f5b6961f8d0e32bcfb6075acfc3480eb001c560d
|
||||
size 4907529456
|
||||
3
model-00003-of-00005.safetensors
Normal file
3
model-00003-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:28b29db84a7ebdbf0da66a7a15bf9203c48cef83a431db22589808dab70d1679
|
||||
size 4907529464
|
||||
3
model-00004-of-00005.safetensors
Normal file
3
model-00004-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1af981e4c96f72ba3bc758752f98c97ab14c0a1da4fce847ac99ef08de6d9dd5
|
||||
size 4907529456
|
||||
3
model-00005-of-00005.safetensors
Normal file
3
model-00005-of-00005.safetensors
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3a5b065bff330f850cb2e111404e37066c74a7f3cf0cef69a6290e3112c281a8
|
||||
size 4907529392
|
||||
1
model.safetensors.index.json
Normal file
1
model.safetensors.index.json
Normal file
File diff suppressed because one or more lines are too long
23
special_tokens_map.json
Normal file
23
special_tokens_map.json
Normal file
@@ -0,0 +1,23 @@
|
||||
{
|
||||
"bos_token": {
|
||||
"content": "<s>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"eos_token": {
|
||||
"content": "</s>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
},
|
||||
"unk_token": {
|
||||
"content": "<unk>",
|
||||
"lstrip": false,
|
||||
"normalized": false,
|
||||
"rstrip": false,
|
||||
"single_word": false
|
||||
}
|
||||
}
|
||||
409625
tokenizer.json
Normal file
409625
tokenizer.json
Normal file
File diff suppressed because it is too large
Load Diff
8013
tokenizer_config.json
Normal file
8013
tokenizer_config.json
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user