初始化项目，由ModelHub XC社区提供模型

Model: DavidAU/L3-Dark-Planet-8B Source: Original Platform
2026-04-25 07:05:19 +08:00
commit de66af9714
25 changed files with 412900 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,36 @@
 *.7z filter=lfs diff=lfs merge=lfs -text
 *.arrow filter=lfs diff=lfs merge=lfs -text
 *.bin filter=lfs diff=lfs merge=lfs -text
 *.bz2 filter=lfs diff=lfs merge=lfs -text
 *.ckpt filter=lfs diff=lfs merge=lfs -text
 *.ftz filter=lfs diff=lfs merge=lfs -text
 *.gz filter=lfs diff=lfs merge=lfs -text
 *.h5 filter=lfs diff=lfs merge=lfs -text
 *.joblib filter=lfs diff=lfs merge=lfs -text
 *.lfs.* filter=lfs diff=lfs merge=lfs -text
 *.mlmodel filter=lfs diff=lfs merge=lfs -text
 *.model filter=lfs diff=lfs merge=lfs -text
 *.msgpack filter=lfs diff=lfs merge=lfs -text
 *.npy filter=lfs diff=lfs merge=lfs -text
 *.npz filter=lfs diff=lfs merge=lfs -text
 *.onnx filter=lfs diff=lfs merge=lfs -text
 *.ot filter=lfs diff=lfs merge=lfs -text
 *.parquet filter=lfs diff=lfs merge=lfs -text
 *.pb filter=lfs diff=lfs merge=lfs -text
 *.pickle filter=lfs diff=lfs merge=lfs -text
 *.pkl filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text
 *.rar filter=lfs diff=lfs merge=lfs -text
 *.safetensors filter=lfs diff=lfs merge=lfs -text
 saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.tar.* filter=lfs diff=lfs merge=lfs -text
 *.tar filter=lfs diff=lfs merge=lfs -text
 *.tflite filter=lfs diff=lfs merge=lfs -text
 *.tgz filter=lfs diff=lfs merge=lfs -text
 *.wasm filter=lfs diff=lfs merge=lfs -text
 *.xz filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 L3-Dark-Planet-brk5.dat filter=lfs diff=lfs merge=lfs -text
--- a/L3-Dark-Planet-brk5.dat
+++ b/L3-Dark-Planet-brk5.dat
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:ffbff1f4582a66d087794954134a0e960227b049a5578428e328cf4ac0a53431
 size 4988192
--- a/README.md
+++ b/README.md
@@ -0,0 +1,151 @@
 ---
 library_name: transformers
 tags:
 - mergekit
 - merge
 - llama-3
 - creative
 - creative writing
 - fiction writing
 - plot generation
 - sub-plot generation
 - fiction writing
 - story generation
 - scene continue
 - storytelling
 - fiction story
 - science fiction
 - romance
 - all genres
 - story
 - writing
 - vivid prose
 - vivid writing
 - fiction
 - roleplaying
 - bfloat16
 - swearing
 - rp
 - llama3
 - llama-3
 - llama-3.1
 - llama 3.1
 - llama3.1
 - horror
 - finetune
 - merge
 base_model: 
 - Sao10K/L3-8B-Stheno-v3.2
 - NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS
 - meta-llama/Meta-Llama-3-8B-Instruct
 - Hastagaras/Jamet-8B-L3-MK.V-Blackroot
 ---
 <h2>L3-Dark-Planet-8B</h2>
 This repo contains the full precision source code, in "safe tensors" format to generate GGUFs, GPTQ, EXL2, AWQ, HQQ and other formats.
 The source code can also be used directly.
 MERGEKIT FILE To Create this Model:
 ```
 models:
  - model: Sao10K/L3-8B-Stheno-v3.2
    parameters:
      weight: [1,1,.75,.5,.25,.25,.05,.01]
  - model: NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS
    parameters:
      weight: [0,0,.25,.35,.4,.25,.30,.04]
  - model: Hastagaras/Jamet-8B-L3-MK.V-Blackroot
    parameters:
      weight: [0,0,0,.15,.35,.5,.65,.95]
 merge_method: dare_ties
 base_model: meta-llama/Meta-Llama-3-8B-Instruct
 dtype: bfloat16
 ```
 MERGE FILE NOTES:
 You can replace the BASE model with Llama 3.1 model(s) to create a 128k version(s) of this model.
 You can also replace the BASE model with Nivida 1,2 or 4 million "Llama 3.1" base models too.
 Likewise, you can replace the BASE model with different Llama 3s to create different flavors of this model.
 In this merge, the "base" model is consumed (weights replaced) COMPLETELY, yet characteristics of the base model will remain.
 If you alter the "weights" (each total 1 (100%) of the 3 models together) to less than 1 -> the remaining will be "BASE" weights and not
 all the "weights" of the base model will be consumed / replaced.
 You can of course, replace all the models too...
 !! => Each of the "weights" apply to 4 layers of the model (32/8) each, you can expand this to 32 to fine tune each layer of each model.
 MERGEKIT:
 https://github.com/arcee-ai/mergekit
 SPECIAL NOTE:
 If you want to see what happens when a "base" is replaced ("SuperNova"), and/or the formula is altered (including use of "density" - which causes random pruning) see the models
 in the "Dark Planet Wordstorm project" - a collection of 12 modified versions of Dark Planet - each model is unique and different from Dark Planet.
 <B>IMPORTANT: Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers</B>
 If you are going to use this model, (source, GGUF or a different quant), please review this document for critical parameter, sampler and advance sampler settings (for multiple AI/LLM aps).
 This a "Class 1" (settings will enhance operation) model:
 For all settings used for this model (including specifics for its "class"), including example generation(s) and for advanced settings guide (which many times addresses any model issue(s)), including methods to improve model performance for all use case(s) as well as chat, roleplay and other use case(s) (especially for use case(s) beyond the model's design) please see:
 [ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]
 REASON:
 Regardless of "model class" this document will detail methods to enhance operations.
 If the model is a Class 3/4 model the default settings (parameters, samplers, advanced samplers) must be set for "use case(s)" uses correctly. Some AI/LLM apps DO NOT have consistant default setting(s) which result in sub-par model operation. Like wise for Class 3/4 models (which operate somewhat to very differently than standard models) additional samplers and advanced samplers settings are required to "smooth out" operation, AND/OR also allow full operation for use cases the model was not designed for.
 BONUS - Use these settings for ANY model, ANY repo, ANY quant (including source/full precision):
 This document also details parameters, sampler and advanced samplers that can be use FOR ANY MODEL, FROM ANY REPO too - all quants, and of course source code operation too - to enhance the operation of any model.
 [ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]
 NOTE:
 I strongly suggest you also visit the DavidAU GGUF (below) repo too for more details in using this model ; especially if it is "Class 3" or "Class 4" to get maximum performance from the model.
 For full information about this model, including:
 - Details about this model and its use case(s).
 - Context limits
 - Special usage notes / settings.
 - Any model(s) used to create this model.
 - Template(s) used to access/use this model.
 - Example generation(s)
 - GGUF quants of this model
 Please go to:
 [ https://huggingface.co/DavidAU/L3-Dark-Planet-8B-GGUF ]
 For "Dark Planet Horror City 8b" :
 [ https://huggingface.co/DavidAU/L3-Dark-Planet-Horror-City-8B-NEO-Imatrix-GGUF ] 
 (To create GGUFs for Dark Planet Horror City 8B use the ".dat" file for Imatrix in this repo )
 Additional Quants:
 [ https://huggingface.co/RichardErkhov/DavidAU_-_L3-Dark-Planet-8B-gguf ]
 [ https://huggingface.co/QuantFactory/L3-Dark-Planet-8B-GGUF ]
 Imatrix GGUFS:
 [ https://huggingface.co/mradermacher/L3-Dark-Planet-8B-i1-GGUF ]
 [ https://huggingface.co/mradermacher/L3-Dark-Planet-8B-GGUF ]
--- a/config.json
+++ b/config.json
@@ -0,0 +1,29 @@
 {
  "_name_or_path": "L3 - Dark Planet 8B",
  "architectures": [
    "LlamaForCausalLM"
  ],
  "attention_bias": false,
  "attention_dropout": 0.0,
  "bos_token_id": 128000,
  "eos_token_id": 128009,
  "hidden_act": "silu",
  "hidden_size": 4096,
  "initializer_range": 0.02,
  "intermediate_size": 14336,
  "max_position_embeddings": 8192,
  "mlp_bias": false,
  "model_type": "llama",
  "num_attention_heads": 32,
  "num_hidden_layers": 32,
  "num_key_value_heads": 8,
  "pretraining_tp": 1,
  "rms_norm_eps": 1e-05,
  "rope_scaling": null,
  "rope_theta": 500000.0,
  "tie_word_embeddings": false,
  "torch_dtype": "bfloat16",
  "transformers_version": "4.43.3",
  "use_cache": true,
  "vocab_size": 128256
 }
--- a/model-00001-of-00017.safetensors
+++ b/model-00001-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:fb43f38e164eaf9621c493414e04fe17fc1e5b63d5b7977ae6ec0ace986ff334
 size 1050673280
--- a/model-00002-of-00017.safetensors
+++ b/model-00002-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:b9fd0a76a417779d3bf75c6f225c985231dc9dce994b2950d94b7c24ea22b8fb
 size 1050673296
--- a/model-00003-of-00017.safetensors
+++ b/model-00003-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:325353d88a74de9f455b7495d0e66bb33b07e3db35bb6d405ead7a3f7dca753d
 size 989899000
--- a/model-00004-of-00017.safetensors
+++ b/model-00004-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:a5eab5a017e6adc1fbb63c3f9e313e94c1bf7587ebf3a803bf331624e84cd658
 size 989890728
--- a/model-00005-of-00017.safetensors
+++ b/model-00005-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:c8de4ed6552fc417f873249661c297df22103a413959266c52c0d2701356e034
 size 998287760
--- a/model-00006-of-00017.safetensors
+++ b/model-00006-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:7c3f231c1a788aa65ced98d63e4912f8358d5e612d8ee916d91f5f85761d4f69
 size 947956216
--- a/model-00007-of-00017.safetensors
+++ b/model-00007-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:0f27f3b2fc40f259670f2ac506e25050042d15759bcef6e6923f1ab9e60235a6
 size 989890728
--- a/model-00008-of-00017.safetensors
+++ b/model-00008-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:70d4684fd42f984c74fffeb166e12418d49125783909d3c2973333d1cc318377
 size 989890720
--- a/model-00009-of-00017.safetensors
+++ b/model-00009-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:2df0b533fd2392c322376a37df661d84e9a424259ebd234409b6dc15a9b83f8a
 size 998287760
--- a/model-00010-of-00017.safetensors
+++ b/model-00010-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:ab044a61f44ef02e62a60dd36112917f8e79ff391adbdbad51dfca7b17833221
 size 947956216
--- a/model-00011-of-00017.safetensors
+++ b/model-00011-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:4ec5168b87320e66b3a121a0e8794768038d06f9affa9e242f597d304e45d9f0
 size 989890728
--- a/model-00012-of-00017.safetensors
+++ b/model-00012-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:4e2574b2950af670611b7564b18c689cd4959b514a839d7cf2a4430d08073dbe
 size 989890728
--- a/model-00013-of-00017.safetensors
+++ b/model-00013-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:98ee1574fd08d49d388e206f4c22fa0cf0dcbc077bc221d3fbb5ab0113a0a3ef
 size 998287752
--- a/model-00014-of-00017.safetensors
+++ b/model-00014-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:4420c6a891260e15e35d00c3ef9d5e8d0fad262cc504a1f5e16ff28604006df2
 size 947956208
--- a/model-00015-of-00017.safetensors
+++ b/model-00015-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:1ae2c0589db603a467398c52f7fd46e21410659c6d113f70ee5a77e31af1cc4c
 size 989890712
--- a/model-00016-of-00017.safetensors
+++ b/model-00016-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:c96c9e8cf64b87f073e08d1770ba2264557e295f7ae590c3e1331976c32d165f
 size 989890712
--- a/model-00017-of-00017.safetensors
+++ b/model-00017-of-00017.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:3ae40ea943e8e1d265822adee1b2ea8bce46ac134990ff5a7f5ebc4a0c2b2907
 size 201343792
--- a/model.safetensors.index.json
+++ b/model.safetensors.index.json
--- a/special_tokens_map.json
+++ b/special_tokens_map.json
@@ -0,0 +1,4 @@
 {
  "bos_token": "<|begin_of_text|>",
  "eos_token": "<|end_of_text|>"
 }
--- a/tokenizer.json
+++ b/tokenizer.json
--- a/tokenizer_config.json
+++ b/tokenizer_config.json