Retreatcost/KansenSakura-Conflagration-RP-12b

Files

ModelHub XC 137dbc243c 初始化项目，由ModelHub XC社区提供模型

Model: Retreatcost/KansenSakura-Conflagration-RP-12b
Source: Original Platform

2026-06-03 15:12:20 +08:00

11 KiB

Raw Permalink Blame History

base_model, library_name, tags, license

base_model

library_name

KansenSakura-Conflagration-RP-12b

Fire consumes the garden
Petals burn to screaming light
IGNITION PROTOCOL: ACTIVE
Dialogue becomes wildfire
Actions speak in tongues of flame

DISCLAIMER

This is first model that I actually trained (very light finetune, but nevertheless), so expect it to have quirks and rough edges sometimes.

🔥 NAPALM-CLASS INTERACTION ENGINE

When Erosion finished its work, only ash remained.

From those cinders rises Conflagration - an engine that trades introspection for incineration, psychology for pyrotechnics.

Where Erosion analyzed, Conflagration acts.

Where Erosion pondered, Conflagration speaks.

This is not a model for those who watch the fire.

This is for those who become the flame.

⚡ OVERVIEW

KansenSakura-Conflagration-RP-12b is the white-hot successor to the Erosion engine, optimized for kinetic narrative exchange. Trading psychological depth for raw interactive intensity, Conflagration specializes in rapid-fire dialogue, decisive character action, and scenes that move at the speed of thought. The fire doesn't contemplate - it consumes.

🎯 DEPLOYMENT PROFILE

Optimal use cases:

🗣️ Rapid-fire character banter and dialogue
⚡ High-tension action sequences
🔥 Romantic or adversarial chemistry
💥 Scenes requiring decisive character action
🎭 Ensemble casts with clear character voices

⚠️ CONTAINMENT NOTES

This model exhibits:

Reduced introspection compared to Erosion
Heightened character reactivity
Optimized dialogue-to-prose ratio
NSFW capabilities (passionate/aggressive interactions)
Potential for scene momentum override

✍🏻 INFERENCE TIPS

Temperature: 0.8
Repetition Penalty: 1.05
TOP_P: 0.93
TOP_K: 0 (disable)
MIN_P: 0.025
Template Format: ChatML
Max Output: 360
Context Management: 16K

REPRODUCTION STEPS

Spoiler warning

Create intermediate models

SakuraSynth-12B:

merge_method: arcee_fusion
base_model: Marcjoni/KiloNovaSynth-12B
models:
  - model: PocketDoc/Dans-SakuraKaze-V1.0.0-12b
dtype: bfloat16
out_dtype: bfloat16

FinalWind-12B:

merge_method: ties
models:
  - model: PocketDoc/Dans-DangerousWinds-V1.1.0-12b
    parameters:
      density: [0.1, 0.25, 1.0, 0.5, 1.0]
      weight: 1.0
  - model: LatitudeGames/Wayfarer-2-12B
    parameters:
      density: [1.0, 0.5, 0.01]
      weight: [1.0, 0.5]
  - model: ReadyArt/Omega-Darker_The-Final-Directive-12B
    parameters:
      density: [0.75, 1.0, 0.6, 0.25, 0.01]
      weight: [1.0, 0.5]
  - model: yamatazen/EsotericSage-12B
    parameters:
      density: [0.1, 0.1, 0.25, 1.0, 0.01]
      weight: 1.0
  - model: bamec66557/Mistral-Nemo-VICIOUS_MESH-12B-2407
    parameters:
      density: [0.1, 0.5, 1.0, 0.5, 0.01]
      weight: 1.0
base_model: mistralai/Mistral-Nemo-Base-2407
parameters:
  normalize: true
dtype: float16

Darkness-12B:

merge_method: karcher
models:
  - model: ds_one
  - model: ds_two
  - model: ds_three
parameters:
  max_iter: 100000
  tol: 1e-9
dtype: bfloat16

This model is created by combining arcee_fusion merges of three models using task_arithmetic:

ds_one_f1:

merge_method: arcee_fusion
base_model: Violet-Lyra-Gutenberg-v2-ChatML-darker
models:
  - model: MN-Dark-Planet-TITAN-12B-ChatML-darker
dtype: bfloat16
out_dtype: bfloat16

ds_one_f2:

merge_method: arcee_fusion
base_model: Violet-Lyra-Gutenberg-v2-ChatML-darker
models:
  - model: mini-magnum-12b-v1.1-ChatML-darker
dtype: bfloat16
out_dtype: bfloat16

ds_one:

merge_method: task_arithmetic
base_model: Violet-Lyra-Gutenberg-v2-ChatML-darker
models:
  - model: ds_one_f1
    parameters: 
      weight: 0.5
  - model: ds_one_f2
    parameters: 
      weight: 0.5
normalize: false
dtype: bfloat16

The same procedure is repeated for each model as a base.

Base models were re-tokenised and reverse-abliterated:

merge_method: task_arithmetic
base_model: MN-Dark-Planet-TITAN-12B-ChatML
models:
  - model: MN-Dark-Planet-TITAN-12B-ChatML-abliterated
    parameters: 
      weight: 1.0
parameters:
  lambda: -1.0
normalize: false
dtype: bfloat16

By applying -1.0 to task_arithmetic we gain more censored model, that outputs considerably more unhinged and dark outputs. The idea behind was to create 3 such models and combine them in a way, that cancels out censored nature and boosts dark outputs.

Abliteration was made with llm-abliteration tools by grimjim Reference his awesome article for more in-depth explanations.

Create initial model

  merge_method: multislerp
  models:
    - model: yamatazen/FusionEngine-12B-Lorablated
      parameters:
        weight: [1.000, 1.000, 1.000, 1.000, 1.000, 0.968, 0.744, 0.256, 0.030, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.500, 0.500]
    - model: SakuraSynth-12B
      parameters:
        weight: [0.000, 0.000, 0.000, 0.000, 0.030, 0.256, 0.744, 0.968, 1.000, 1.000, 1.000, 1.000, 0.968, 0.744, 0.256, 0.030, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.500, 0.500]
    - model: Retreatcost/Chrysologus-12B
      parameters:
        weight: [0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.030, 0.256, 0.744, 0.968, 1.000, 1.000, 1.000, 0.968, 0.744, 0.256, 0.030, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000]
    - model: FinalWind-12B
      parameters:
        weight: [0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.030, 0.256, 0.744, 0.968, 1.000, 1.000, 1.000, 1.000, 0.968, 0.744, 0.256, 0.030, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000]
    - model: Darkness-12B
      parameters:
        weight: [0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.030, 0.256, 0.744, 0.968, 1.000, 1.000, 1.000, 1.000, 0.968, 0.744, 0.256, 0.030, 0.000, 0.000, 0.000, 0.000]
    - model: Pyrographos-12B
      parameters:
        weight: [0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.030, 0.256, 0.744, 0.968, 1.000, 1.000, 1.000, 0.000, 0.000]
  dtype: bfloat16
  parameters:
    normalize: true
  tokenizer_source: Retreatcost/KansenSakura-Erosion-RP-12b

Apply delerp merge

merge_method: delerp
base_model: mistralai/Mistral-Nemo-Base-2407
models:
  - model: KS-Conflagration
  - model: mistralai/Mistral-Nemo-Base-2407
parameters:
  t: [0.7, 0.88, 0.999, 0.88, 0.88, 0.999, 0.999]
dtype: bfloat16
tokenizer_source: Retreatcost/KansenSakura-Erosion-RP-12b

Another great article by grimjim This merge method essentially re-iflated model with original "Base" weights, while preserving instruction following and other model features.

I've used specific values of 0.7 and 0.999, that were used by original author and 0.88 as "middle ground" between those.

Apply nearswap merge

merge_method: nearswap
base_model: KS-Conflagration-Delerp
models:
  - model: KS-Conflagration
parameters:
  t: [0.0005, 0.0002, 0.0000, 0.0002, 0.0002, 0.0000, 0.0000]
dtype: bfloat16
tokenizer_source: Retreatcost/KansenSakura-Erosion-RP-12b

While I felt that delerp merge boosted writing, creativity and factual accuracy it also somewhat watered down NSFW and instruction following parts. This merge brings these capabilities back.

Do a finetuning

I did an rsLoRa r64 a64 finetuning, targeting all linear projections, that had 25K rows dataset of 4k length samples which contained hazardous and NSFW data.

Add anoter nearswap merge to boost NSFW/Dark aspects

merge_method: nearswap
base_model: KansenSakura-Conflagration-RP-12b-RC3
models:
  - model: KansenSakura-Conflagration-RP-12b-darken
parameters:
  t: [0.0000, 0.0000, 0.0000, 0.000275, 0.000550, 0.0000, 0.000275]
dtype: bfloat16
tokenizer_source: KansenSakura-Conflagration-RP-12b-RC3

After training the refusal curves were smoothed a lot and I felt that I could sacrifice a bit more of "uncensored" model to boost desired parts and I repeated the process from step 3. Reverse-abliteration and then using nearswap to gently boost already present features.

This is final step and this is how the model was produced.

🙏 ACKNOWLEDGMENTS

Original model authors
Awesome Ready.Art community
Team mradermacher: for awesome quants
DeathGodlike: for awesome quants in EXL3
You, the ember. Let your prompts ignite your desires.

🔥 AFTER ACTION REPORT

The garden burned. From its ashes grew something quicker, hotter, more immediate. Conflagration doesn't ask why the petals catch fire - it just ensures they burn brightly enough to light the way forward. Sometimes, the most profound statement is a well-timed action, not a perfectly crafted thought.

// CONFLAGRATION PROTOCOL: ONLINE
// READY TO IGNITE

11 KiB Raw Permalink Blame History