1741 lines
53 KiB
Markdown
1741 lines
53 KiB
Markdown
---
|
|
base_model:
|
|
- allura-org/Tlacuilo-12B
|
|
- ChaoticNeutrals/Captain_Eris_Noctis-12B-v0.420
|
|
- DavidAU/Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5.2-Uncensored-HERETIC
|
|
- DavidAU/MN-Dark-Planet-TITAN-12B
|
|
- DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS
|
|
- DreadPoor/Famino-12B-Model_Stock
|
|
- EldritchLabs/Cactus-Dream-Horror-12B
|
|
- EldritchLabs/Human-Like-Mistral-Nemo-Instruct-2407-MPOA
|
|
- EldritchLabs/Kraken-Karcher-12B-v1
|
|
- EldritchLabs/MN-12B-Mag-Mell-R1-Uncensored-Scale1.2
|
|
- EldritchLabs/MN-12B-RP-Ink-Longform-MPOA
|
|
- Epiculous/Violet_Twilight-v0.2
|
|
- IggyLux/MN-VelvetCafe-RP-12B-V2
|
|
- inflatebot/MN-12B-Mag-Mell-R1
|
|
- LatitudeGames/Muse-12B
|
|
- LatitudeGames/Wayfarer-2-12B
|
|
- mistralai/Mistral-Nemo-Instruct-2407
|
|
- MrRikyz/StarlightMoon-Foxfire-12B
|
|
- MuXodious/Rocinante-X-12B-v1-absolute-heresy
|
|
- Naphula/Ancient-Awakening-12B
|
|
- Naphula/Riemannian-Redshift-12B-v1
|
|
- ohyeah1/Violet-Lyra-Gutenberg-v2
|
|
- PocketDoc/Dans-SakuraKaze-V1.0.0-12b
|
|
- PygmalionAI/Pygmalion-3-12B
|
|
- rAIfle/Questionable-MN-bf16
|
|
- ReadyArt/Dark-Nexus-12B-v2.0
|
|
- ReadyArt/Forgotten-Safeword-12B-v4.0
|
|
- redrix/GodSlayer-12B-ABYSS
|
|
- Retreatcost/Chrysologus-12B
|
|
- Retreatcost/Impish-LongPen-12B
|
|
- Retreatcost/KansenSakura-Conflagration-RP-12b
|
|
- SicariusSicariiStuff/Impish_Bloodmoon_12B
|
|
- Sorihon/Celestial-Queen-12B-Heretic
|
|
- SuperbEmphasis/MN-12b-RP-Ink-RP-Longform
|
|
- SuperbEmphasis/Omega-Darker_The-Final-Directive-Longform-Stage2-ERP-12B-v0.2
|
|
- TheDrummer/Rocinante-X-12B-v1
|
|
- Vortex5/Aurora-Mirage-12B
|
|
- Vortex5/Prototype-X-12b
|
|
datasets:
|
|
- OccultAI/illuminati_imatrix_v1
|
|
language:
|
|
- en
|
|
library_name: transformers
|
|
license: apache-2.0
|
|
tags:
|
|
- creative
|
|
- creative writing
|
|
- fiction writing
|
|
- plot generation
|
|
- sub-plot generation
|
|
- fiction writing
|
|
- story generation
|
|
- scene continue
|
|
- storytelling
|
|
- fiction story
|
|
- science fiction
|
|
- romance
|
|
- all genres
|
|
- story
|
|
- writing
|
|
- vivid prosing
|
|
- vivid writing
|
|
- fiction
|
|
- roleplaying
|
|
- float32
|
|
- swearing
|
|
- rp
|
|
- horror
|
|
- mistral
|
|
- nemo
|
|
- merge
|
|
- mergekit
|
|
widget:
|
|
- text: "🦑 KrakenSakura-Maelström-12B-v1"
|
|
output:
|
|
url: https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/ssPstIfsBZ-lw4Skif0w7.png
|
|
---
|
|
|
|
<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/y6TzIZP4XvaH_B7iQKV6O.mpga"></audio>
|
|
|
|
> [!WARNING]
|
|
> <span style="color:red; font-weight:bold">⚠️ Warning:</span> This model can produce narratives and RP that contain violent and graphic erotic content. Adjust your system prompt accordingly, and use **ChatML** (recommended) or **Mistral Tekken/NonTekken** chat template.
|
|
>
|
|
|
|
<!DOCTYPE html>
|
|
<style>
|
|
body {
|
|
font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
|
|
color: #D1D5DB; /* Pale stone gray */
|
|
line-height: 1.6;
|
|
margin: 0;
|
|
padding: 0;
|
|
background-color: #0A0C10; /* Very dark stormy gray/black */
|
|
}
|
|
|
|
b, strong {
|
|
color: #FBBF24; /* Glowing amber/gold */
|
|
text-shadow: 0 0 8px rgba(251, 191, 36, 0.4);
|
|
}
|
|
|
|
.awakening-text {
|
|
color: #FEF3C7; /* Pale inner-eye yellow */
|
|
position: relative;
|
|
z-index: 2;
|
|
margin-left: 0.2em;
|
|
text-shadow: 0 0 15px #F59E0B, 0 0 30px #B45309; /* Deep fiery orange/gold glow */
|
|
font-size: 1.6rem;
|
|
letter-spacing: 1px;
|
|
font-weight: 600;
|
|
}
|
|
|
|
/* Section styling */
|
|
.section-container {
|
|
background-color: rgba(17, 24, 39, 0.85); /* Dark slate rock */
|
|
margin-bottom: 30px;
|
|
position: relative;
|
|
overflow: hidden;
|
|
border-bottom: 1px solid #78350F; /* Dark bronze/earth */
|
|
box-shadow: 0 4px 20px rgba(0, 0, 0, 0.6);
|
|
}
|
|
|
|
.section-header {
|
|
display: flex;
|
|
align-items: center;
|
|
background-color: rgba(245, 158, 11, 0.05); /* Faint amber tint */
|
|
padding: 10px 20px;
|
|
border-top: 1px solid rgba(120, 53, 15, 0.4);
|
|
}
|
|
|
|
.section-indicator {
|
|
width: 8px;
|
|
height: 20px;
|
|
background-color: #F59E0B; /* Amber eye color */
|
|
margin-right: 15px;
|
|
box-shadow: 0 0 10px rgba(245, 158, 11, 0.6);
|
|
border-radius: 2px;
|
|
}
|
|
|
|
.section-title {
|
|
font-family: 'Georgia', 'Times New Roman', serif; /* Ancient tome feel */
|
|
color: #FDE68A; /* Light gold */
|
|
font-size: 1.4rem;
|
|
margin: 0;
|
|
letter-spacing: 1px;
|
|
font-weight: 400;
|
|
text-transform: capitalize;
|
|
}
|
|
|
|
.section-content {
|
|
padding: 20px;
|
|
font-family: sans-serif;
|
|
color: #D1D5DB;
|
|
line-height: 1.6;
|
|
}
|
|
|
|
/* Title styling */
|
|
.title-container {
|
|
background-color: #050505; /* Pitch black */
|
|
position: relative;
|
|
overflow: hidden;
|
|
margin-bottom: 40px;
|
|
border-left: 4px solid #F59E0B; /* Amber pillar */
|
|
box-shadow: 0 6px 25px rgba(245, 158, 11, 0.15);
|
|
}
|
|
|
|
.title-wrapper {
|
|
position: relative;
|
|
z-index: 2;
|
|
padding: 25px 20px 30px 30px;
|
|
font-family: 'Georgia', 'Times New Roman', serif;
|
|
}
|
|
|
|
.title-main {
|
|
color: #FEF3C7;
|
|
font-size: 2.0rem;
|
|
font-weight: 700;
|
|
margin: 0;
|
|
letter-spacing: 2px;
|
|
display: inline-block;
|
|
position: relative;
|
|
text-transform: uppercase;
|
|
}
|
|
|
|
.storm-overlay {
|
|
position: absolute;
|
|
top: 0;
|
|
left: 0;
|
|
width: 100%;
|
|
height: 100%;
|
|
/* Dark, brooding radial fog mimicking the eye's aura */
|
|
background-image: radial-gradient(circle at 50% 50%, rgba(245, 158, 11, 0.08) 0%, rgba(0,0,0,0.9) 80%);
|
|
z-index: 1;
|
|
}
|
|
|
|
/* Subheading styling */
|
|
.subheading {
|
|
color: #D97706; /* Deep orange */
|
|
font-size: 1.1rem;
|
|
margin-top: 20px;
|
|
margin-bottom: 15px;
|
|
font-weight: 400;
|
|
border-bottom: 1px dashed rgba(217, 119, 6, 0.4);
|
|
display: inline-block;
|
|
text-transform: uppercase;
|
|
letter-spacing: 1px;
|
|
font-family: 'Georgia', 'Times New Roman', serif;
|
|
}
|
|
|
|
/* Links */
|
|
a {
|
|
color: #FBBF24; /* Amber */
|
|
text-decoration: none;
|
|
transition: color 0.3s ease, text-shadow 0.3s ease;
|
|
}
|
|
|
|
a:hover {
|
|
text-decoration: underline;
|
|
color: #FDE68A; /* Brighter gold */
|
|
text-shadow: 0 0 8px rgba(251, 191, 36, 0.5);
|
|
}
|
|
|
|
/* Container */
|
|
.container {
|
|
max-width: 1200px;
|
|
margin: 20px auto;
|
|
padding: 40px 20px;
|
|
background-color: #0D1117; /* Deep stormy night */
|
|
background-image:
|
|
radial-gradient(circle at 15% 85%, rgba(120, 53, 15, 0.1) 0%, transparent 50%),
|
|
radial-gradient(circle at 85% 15%, rgba(245, 158, 11, 0.05) 0%, transparent 50%);
|
|
min-height: calc(100vh - 40px);
|
|
border: 1px solid #1F2937; /* Dark stone border */
|
|
border-radius: 8px;
|
|
box-shadow: 0 8px 40px rgba(0, 0, 0, 0.9), inset 0 0 20px rgba(0, 0, 0, 0.5);
|
|
}
|
|
|
|
/* Code blocks */
|
|
pre {
|
|
background-color: #050505; /* Pitch black */
|
|
border: 1px solid #1F2937; /* Dark stone */
|
|
border-left: 3px solid #92400E; /* Dark orange/brown */
|
|
padding: 15px;
|
|
border-radius: 4px;
|
|
color: #D1D5DB;
|
|
overflow-x: auto;
|
|
}
|
|
code {
|
|
font-family: 'Courier New', Courier, monospace;
|
|
color: #FBBF24; /* Amber */
|
|
background-color: rgba(245, 158, 11, 0.08);
|
|
padding: 2px 4px;
|
|
border-radius: 3px;
|
|
}
|
|
pre code {
|
|
color: #00FFFF;
|
|
background-color: transparent;
|
|
padding: 0;
|
|
}
|
|
|
|
</style>
|
|
<html lang="en">
|
|
<head>
|
|
<meta charset="UTF-8">
|
|
<meta name="viewport" content="width=device-width, initial-scale=1.0">
|
|
<title>🦑 KrakenSakura Maelström 12B v1</title>
|
|
</head>
|
|
<body>
|
|
|
|
<div class="container">
|
|
<div class="title-container">
|
|
<div class="storm-overlay"></div>
|
|
<div class="title-wrapper">
|
|
<h2 class="title-main">
|
|
<span class="awakening-text">🦑 KrakenSakura Maelström 12B v1</span>
|
|
</h2>
|
|
</div>
|
|
</div>
|
|
|
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/ssPstIfsBZ-lw4Skif0w7.png"
|
|
alt="KrakenSakura Maelström"
|
|
style="display: block; margin: 0 auto 30px auto; max-width: 100%; height: auto; border-radius: 5px; border: 1px solid #1F2937; box-shadow: 0 0 25px rgba(245, 158, 11, 0.15);">
|
|
|
|
<div class="section-container">
|
|
<div class="section-header">
|
|
<div class="section-indicator"></div>
|
|
<h2 class="section-title">Overview</h2>
|
|
</div>
|
|
<div class="section-content"><font face="verdana">
|
|
This is a merge of pre-trained language models created using <a href="https://github.com/cg123/mergekit">mergekit</a>.<br><br><b>KrakenSakura</b> is fully uncensored, no jailbreaks or ablation needed. This model produces unique output and is calibrated for <b>NSFW/RP</b>. Unlike most Nemo models, KrakenSakura avoids generic "Lily" stories and instead wrote about "a brave knight named Sir Galahad".<br><br><b>Note:</b> I only saw refusals when using high temps. Lower the temp or re-roll if you encounter censorship.<br><br>For instruct tag preset use either <b>ChatML</b> or <b>Mistral Tekken/NonTekken</b>. ChatML is a bit more stable and recommended, while Tekken/NonTekken is slightly more creative, but at the cost of occasional errors such as inserting random russian words like <code>командира</code>. Experiment to see which template works best for your use case.
|
|
</div>
|
|
</div>
|
|
|
|
<div class="section-container">
|
|
<div class="section-header">
|
|
<div class="section-indicator"></div>
|
|
<h2 class="section-title">Merge Details</h2>
|
|
</div>
|
|
<div class="section-content"><font face="verdana">
|
|
<b>Merge Methods</b><br>
|
|
This model was synthesized using a complex multi-stage process involving the following 18 methods:
|
|
<ul>
|
|
<li><a href="https://en.wikipedia.org/wiki/Slerp">nuslerp</a></li>
|
|
<li><a href="https://github.com/arcee-ai/mergekit/blob/main/docs/merge_methods.md">passthrough</a></li>
|
|
<li><a href="https://huggingface.co/24B-Suite/Mergedonia-Suite-24B-v1/discussions/2">pdq</a></li>
|
|
<li><a href="https://arxiv.org/abs/2406.11617">della</a></li>
|
|
<li><a href="https://huggingface.co/24B-Suite/Mergedonia-Suite-24B-v1/discussions/2">chiral_qhe</a></li>
|
|
<li><a href="https://www.arcee.ai/blog/meet-mergekit-v0-1-arcee-fusion-expanded-model-support-multi-gpu-acceleration">arcee_fusion</a></li>
|
|
<li><a href="https://huggingface.co/alchemonaut/QuartetAnemoi-70B-t0.0001">nearswap</a></li>
|
|
<li><a href="https://en.wikipedia.org/wiki/Slerp">multislerp</a></li>
|
|
<li><a href="https://arxiv.org/abs/2406.11617">della_linear</a></li>
|
|
<li><a href="https://en.wikipedia.org/wiki/Karcher_mean">karcher</a></li>
|
|
<li><a href="https://huggingface.co/24B-Suite/Mergedonia-Suite-24B-v1/discussions/2">flux</a></li>
|
|
<li><a href="https://huggingface.co/datasets/OccultAI/Script_Tests/discussions/1">rsce</a></li>
|
|
<li><a href="https://huggingface.co/24B-Suite/Mergedonia-Suite-24B-v1/discussions/2">magic</a></li>
|
|
<li><a href="https://huggingface.co/Naphula-Archives/arcee_multifusion_prototype-24B-Q8_0-GGUF">arcee_multifusion</a></li>
|
|
<li><a href="https://arxiv.org/abs/2311.03099">dare_linear</a></li>
|
|
<li><a href="https://huggingface.co/blog/grimjim/delerp-merge-method">delerp</a></li>
|
|
<li><a href="https://huggingface.co/24B-Suite/Mergedonia-Suite-24B-v1/discussions/2">cvs</a></li>
|
|
<li><a href="https://huggingface.co/24B-Suite/Mergedonia-Suite-24B-v1/discussions/2">delerp_della</a></li>
|
|
</ul>
|
|
<br>The <a href="https://huggingface.co/spaces/Naphula/model_tools/blob/main/graph_v18.py">graph_v18.py</a> patch was helpful to use 8GB VRAM for acceleration.
|
|
<hr>
|
|
<b>Models Merged</b><br>
|
|
The following 38 models were alchemized into this merge:<br><br>
|
|
<details>
|
|
<summary style="cursor: pointer; color: #FBBF24; font-weight: bold;">Show 38 Donor Models</summary>
|
|
<ul>
|
|
<li><a href="https://huggingface.co/allura-org/Tlacuilo-12B">allura-org/Tlacuilo-12B</a></li>
|
|
<li><a href="https://huggingface.co/ChaoticNeutrals/Captain_Eris_Noctis-12B-v0.420">ChaoticNeutrals/Captain_Eris_Noctis-12B-v0.420</a></li>
|
|
<li><a href="https://huggingface.co/DavidAU/Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5.2-Uncensored-HERETIC">DavidAU/Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5.2-Uncensored-HERETIC</a></li>
|
|
<li><a href="https://huggingface.co/DavidAU/MN-Dark-Planet-TITAN-12B">DavidAU/MN-Dark-Planet-TITAN-12B</a></li>
|
|
<li><a href="https://huggingface.co/DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS">DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS</a></li>
|
|
<li><a href="https://huggingface.co/DreadPoor/Famino-12B-Model_Stock">DreadPoor/Famino-12B-Model_Stock</a></li>
|
|
<li><a href="https://huggingface.co/EldritchLabs/Cactus-Dream-Horror-12B">EldritchLabs/Cactus-Dream-Horror-12B</a></li>
|
|
<li><a href="https://huggingface.co/EldritchLabs/Human-Like-Mistral-Nemo-Instruct-2407-MPOA">EldritchLabs/Human-Like-Mistral-Nemo-Instruct-2407-MPOA</a></li>
|
|
<li><a href="https://huggingface.co/EldritchLabs/Kraken-Karcher-12B-v1">EldritchLabs/Kraken-Karcher-12B-v1</a></li>
|
|
<li><a href="https://huggingface.co/EldritchLabs/MN-12B-Mag-Mell-R1-Uncensored-Scale1.2">EldritchLabs/MN-12B-Mag-Mell-R1-Uncensored-Scale1.2</a></li>
|
|
<li><a href="https://huggingface.co/EldritchLabs/MN-12B-RP-Ink-Longform-MPOA">EldritchLabs/MN-12B-RP-Ink-Longform-MPOA</a></li>
|
|
<li><a href="https://huggingface.co/Epiculous/Violet_Twilight-v0.2">Epiculous/Violet_Twilight-v0.2</a></li>
|
|
<li><a href="https://huggingface.co/IggyLux/MN-VelvetCafe-RP-12B-V2">IggyLux/MN-VelvetCafe-RP-12B-V2</a></li>
|
|
<li><a href="https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1">inflatebot/MN-12B-Mag-Mell-R1</a></li>
|
|
<li><a href="https://huggingface.co/LatitudeGames/Muse-12B">LatitudeGames/Muse-12B</a></li>
|
|
<li><a href="https://huggingface.co/LatitudeGames/Wayfarer-2-12B">LatitudeGames/Wayfarer-2-12B</a></li>
|
|
<li><a href="https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407">mistralai/Mistral-Nemo-Instruct-2407</a></li>
|
|
<li><a href="https://huggingface.co/MrRikyz/StarlightMoon-Foxfire-12B">MrRikyz/StarlightMoon-Foxfire-12B</a></li>
|
|
<li><a href="https://huggingface.co/MuXodious-/Rocinante-X-12B-v1-absolute-heresy">MuXodious/Rocinante-X-12B-v1-absolute-heresy</a></li>
|
|
<li><a href="https://huggingface.co/Naphula/Ancient-Awakening-12B">Naphula/Ancient-Awakening-12B</a></li>
|
|
<li><a href="https://huggingface.co/Naphula/Riemannian-Redshift-12B-v1">Naphula/Riemannian-Redshift-12B-v1</a></li>
|
|
<li><a href="https://huggingface.co/ohyeah1/Violet-Lyra-Gutenberg-v2">ohyeah1/Violet-Lyra-Gutenberg-v2</a></li>
|
|
<li><a href="https://huggingface.co/PocketDoc/Dans-SakuraKaze-V1.0.0-12b">PocketDoc/Dans-SakuraKaze-V1.0.0-12b</a></li>
|
|
<li><a href="https://huggingface.co/PygmalionAI/Pygmalion-3-12B">PygmalionAI/Pygmalion-3-12B</a></li>
|
|
<li><a href="https://huggingface.co/rAIfle/Questionable-MN-bf16">rAIfle/Questionable-MN-bf16</a></li>
|
|
<li><a href="https://huggingface.co/ReadyArt/Dark-Nexus-12B-v2.0">ReadyArt/Dark-Nexus-12B-v2.0</a></li>
|
|
<li><a href="https://huggingface.co/ReadyArt/Forgotten-Safeword-12B-v4.0">ReadyArt/Forgotten-Safeword-12B-v4.0</a></li>
|
|
<li><a href="https://huggingface.co/redrix/GodSlayer-12B-ABYSS">redrix/GodSlayer-12B-ABYSS</a></li>
|
|
<li><a href="https://huggingface.co/Retreatcost/Chrysologus-12B">Retreatcost/Chrysologus-12B</a></li>
|
|
<li><a href="https://huggingface.co/Retreatcost/Impish-LongPen-12B">Retreatcost/Impish-LongPen-12B</a></li>
|
|
<li><a href="https://huggingface.co/Retreatcost/KansenSakura-Conflagration-RP-12b">Retreatcost/KansenSakura-Conflagration-RP-12b</a></li>
|
|
<li><a href="https://huggingface.co/SicariusSicariiStuff/Impish_Bloodmoon_12B">SicariusSicariiStuff/Impish_Bloodmoon_12B</a></li>
|
|
<li><a href="https://huggingface.co/Sorihon/Celestial-Queen-12B-Heretic">Sorihon/Celestial-Queen-12B-Heretic</a></li>
|
|
<li><a href="https://huggingface.co/SuperbEmphasis/MN-12b-RP-Ink-RP-Longform">SuperbEmphasis/MN-12b-RP-Ink-RP-Longform</a></li>
|
|
<li><a href="https://huggingface.co/SuperbEmphasis/Omega-Darker_The-Final-Directive-Longform-Stage2-ERP-12B-v0.2">SuperbEmphasis/Omega-Darker_The-Final-Directive-Longform-Stage2-ERP-12B-v0.2</a></li>
|
|
<li><a href="https://huggingface.co/TheDrummer/Rocinante-X-12B-v1">TheDrummer/Rocinante-X-12B-v1</a></li>
|
|
<li><a href="https://huggingface.co/Vortex5/Aurora-Mirage-12B">Vortex5/Aurora-Mirage-12B</a></li>
|
|
<li><a href="https://huggingface.co/Vortex5/Prototype-X-12b">Vortex5/Prototype-X-12b</a></li>
|
|
</ul>
|
|
</div>
|
|
</details>
|
|
</div>
|
|
|
|
<div class="section-container">
|
|
<div class="section-header">
|
|
<div class="section-indicator"></div>
|
|
<h2 class="section-title">Merge Pipeline & Configuration</h2>
|
|
</div>
|
|
<div class="section-content">
|
|
<p><b>🦑 KrakenSakura Maelström 12B v1</b> unites several methods and 38 models into one. This is a highly experimental merge that required 41 steps to build.</p>
|
|
🔑 Here is the "master key" for the 18 nuslerps and passthrough:
|
|
<ul>
|
|
<li> <b>0-5</b> <code>IggyLux/MN-VelvetCafe-RP-12B-V2</code></li>
|
|
<li> <b>5-14</b> SLERP4 = SLERP1 (<code>inflatebot/MN-12B-Mag-Mell-R1</code> + <code>PygmalionAI/Pygmalion-3-12B</code>) + SLERP3 (<code>TheDrummer/Rocinante-X-12B-v1</code> + SLERP2 (<code>DavidAU/MN-Dark-Planet-TITAN-12B</code> + <code>DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS</code>)</li>
|
|
<li> <b>14-22</b> SLERP7 = <code>SicariusSicariiStuff/Impish_Bloodmoon_12B</code> + SLERP6 (<code>MrRikyz/StarlightMoon-Foxfire-12B</code> + SLERP5 (<code>Retreatcost/Impish-LongPen-12B</code> + <code>Retreatcost/Chrysologus-12B</code>))</li>
|
|
<li> <b>22-29</b> SLERP12 = SLERP9 (<code>ReadyArt/Dark-Nexus-12B-v2.0</code> + SLERP8 (<code>ReadyArt/Forgotten-Safeword-12B-v4.0</code> + <code>SuperbEmphasis/Omega-Darker_The-Final-Directive-Longform-Stage2-ERP-12B-v0.2</code>)) + SLERP11 (<code>ohyeah1/Violet-Lyra-Gutenberg-v2</code> + SLERP10 (<code>Epiculous/Violet_Twilight-v0.2</code> + <code>ChaoticNeutrals/Captain_Eris_Noctis-12B-v0.420</code>))</li>
|
|
<li> <b>29-34</b> SLERP16 = SLERP13 (<code>Vortex5/Aurora-Mirage-12B</code> + <code>Vortex5/Prototype-X-12b</code>) + SLERP15 (<code>Naphula/Ancient-Awakening-12B</code> + SLERP14 (<code>redrix/GodSlayer-12B-ABYSS</code> + <code>Retreatcost/KansenSakura-Conflagration-RP-12b</code>))</li>
|
|
<li> <b>34-39</b> SLERP18 = <code>LatitudeGames/Muse-12B</code> + SLERP17 (<code>LatitudeGames/Wayfarer-2-12B</code> + <code>allura-org/Tlacuilo-12B</code>)</li>
|
|
<li> <b>39-40</b> <code>PocketDoc/Dans-SakuraKaze-V1.0.0-12b</code></li>
|
|
</ul>
|
|
<hr>
|
|
<details>
|
|
<summary style="cursor: pointer; color: #FBBF24; font-weight: bold;">Show 41 YAML Configs</summary>
|
|
<h3 class="subheading">Stage 1: nuslerp1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--inflatebot--MN-12B-Mag-Mell-R1
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\models--PygmalionAI--Pygmalion-3-12B
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 2: nuslerp2</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--DavidAU--MN-Dark-Planet-TITAN-12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\models--DavidAU--MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 3: nuslerp3</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--TheDrummer--Rocinante-X-12B-v1
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP2
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 4: nuslerp4</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\SLERP1
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP2
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 5: nuslerp5</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--Retreatcost--Impish-LongPen-12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\models--Retreatcost--Chrysologus-12B
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 6: nuslerp6</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--MrRikyz--StarlightMoon-Foxfire-12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP5
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 7: nuslerp7</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--SicariusSicariiStuff--Impish_Bloodmoon_12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP6
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 8: nuslerp8</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--ReadyArt--Forgotten-Safeword-12B-v4.0
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\models--ReadyArt--Dark-Nexus-12B-v2.0
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 9: nuslerp9</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--SuperbEmphasis--Omega-Darker_The-Final-Directive-Longform-Stage2-ERP-12B-v0.2
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP8
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 10: nuslerp10</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--Epiculous--Violet_Twilight-v0.2
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\models--ChaoticNeutrals--Captain_Eris_Noctis-12B-v0.420
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 11: nuslerp11</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--ohyeah1--Violet-Lyra-Gutenberg-v2
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP10
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 12: nuslerp12</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\SLERP9
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP11
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 13: nuslerp13</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--Vortex5--Aurora-Mirage-12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\models--Vortex5--Prototype-X-12b
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 14: nuslerp14</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--redrix--GodSlayer-12B-ABYSS
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\models--Retreatcost--KansenSakura-Conflagration-RP-12b
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 15: nuslerp15</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--Naphula--Ancient-Awakening-12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP14
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 16: nuslerp16</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\SLERP13
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP15
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 17: nuslerp17</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--LatitudeGames--Wayfarer-2-12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\!models--allura-org--Tlacuilo-12B
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 18: nuslerp18</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\models--LatitudeGames--Muse-12B
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\SLERP17
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 19: passthrough1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: passthrough
|
|
slices:
|
|
- sources:
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
layer_range: [0, 5]
|
|
- sources:
|
|
- model: B:\12B\SLERP4
|
|
layer_range: [5, 14]
|
|
- sources:
|
|
- model: B:\12B\SLERP7
|
|
layer_range: [14, 22]
|
|
- sources:
|
|
- model: B:\12B\SLERP12
|
|
layer_range: [22, 29]
|
|
- sources:
|
|
- model: B:\12B\SLERP16
|
|
layer_range: [29, 35]
|
|
- sources:
|
|
- model: B:\12B\SLERP18
|
|
layer_range: [35, 39]
|
|
- sources:
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
layer_range: [39, 40]
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 20: pdq1</h3>
|
|
<pre><code>merge_method: pdq
|
|
pdq_base_yaml: B:\12B\19-passthrough\20-pdq5.yml
|
|
pdq_base_model: B:\12B\19-passthrough
|
|
output_dir: B:\12B\pdq1
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
models:
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
- model: B:\12B\SLERP4
|
|
- model: B:\12B\SLERP7
|
|
- model: B:\12B\SLERP12
|
|
- model: B:\12B\SLERP16
|
|
- model: B:\12B\SLERP18
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
parameters:
|
|
chi: 0.15
|
|
iota: 0.1
|
|
nu: 24
|
|
gamma: 1.0
|
|
zeta: 16
|
|
sigma: 0.5
|
|
density: 0.9
|
|
epsilon: 0.099
|
|
lambda: 1.0
|
|
lazy_unpickle: True
|
|
random_seed: 420
|
|
name: Stage 20 PDQ</code></pre>
|
|
|
|
<h3 class="subheading">Stage 21: della1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP1
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP2
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP3
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP4
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP5
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP6
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP7
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP8
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP9
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP10
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP11
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP12
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP13
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP14
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP15
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP16
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP17
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\SLERP18
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\19-passthrough
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\pdq1
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
merge_method: della
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
parameters:
|
|
lambda: 1.0
|
|
normalize: false
|
|
int8_mask: false
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 22: pdq2</h3>
|
|
<pre><code>merge_method: pdq
|
|
pdq_base_yaml: B:\12B\19-passthrough\22-pdq20.yml
|
|
pdq_base_model: B:\12B\19-passthrough
|
|
output_dir: B:\12B\pdq3
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
models:
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
- model: B:\12B\SLERP1
|
|
- model: B:\12B\SLERP2
|
|
- model: B:\12B\SLERP3
|
|
- model: B:\12B\SLERP4
|
|
- model: B:\12B\SLERP5
|
|
- model: B:\12B\SLERP6
|
|
- model: B:\12B\SLERP7
|
|
- model: B:\12B\SLERP8
|
|
- model: B:\12B\SLERP9
|
|
- model: B:\12B\SLERP10
|
|
- model: B:\12B\SLERP11
|
|
- model: B:\12B\SLERP12
|
|
- model: B:\12B\SLERP13
|
|
- model: B:\12B\SLERP14
|
|
- model: B:\12B\SLERP15
|
|
- model: B:\12B\SLERP16
|
|
- model: B:\12B\SLERP17
|
|
- model: B:\12B\SLERP18
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
parameters:
|
|
chi: 0.15
|
|
iota: 0.1
|
|
nu: 24
|
|
gamma: 1.0
|
|
zeta: 16
|
|
sigma: 0.5
|
|
density: 0.9
|
|
epsilon: 0.099
|
|
lambda: 1.0
|
|
lazy_unpickle: True
|
|
random_seed: 420
|
|
name: Stage 22 PDQ</code></pre>
|
|
|
|
<h3 class="subheading">Stage 23: nuslerp19</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\pdq1
|
|
parameters:
|
|
weight: 0.5
|
|
- model: B:\12B\pdq3
|
|
parameters:
|
|
weight: 0.5
|
|
parameters:
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 24: chiral_qhe1</h3>
|
|
<pre><code>merge_method: chiral_qhe
|
|
models:
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
- model: B:\12B\SLERP1
|
|
- model: B:\12B\SLERP2
|
|
- model: B:\12B\SLERP3
|
|
- model: B:\12B\SLERP4
|
|
- model: B:\12B\SLERP5
|
|
- model: B:\12B\SLERP6
|
|
- model: B:\12B\SLERP7
|
|
- model: B:\12B\SLERP8
|
|
- model: B:\12B\SLERP9
|
|
- model: B:\12B\SLERP10
|
|
- model: B:\12B\SLERP11
|
|
- model: B:\12B\SLERP12
|
|
- model: B:\12B\SLERP13
|
|
- model: B:\12B\SLERP14
|
|
- model: B:\12B\SLERP15
|
|
- model: B:\12B\SLERP16
|
|
- model: B:\12B\SLERP17
|
|
- model: B:\12B\SLERP18
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
- model: B:\12B\19-passthrough
|
|
- model: B:\12B\21-Della
|
|
- model: B:\12B\SLERP-PDQ
|
|
parameters:
|
|
chi: 0.15
|
|
iota: 0.1
|
|
nu: 24
|
|
gamma: 1.0
|
|
zeta: 16
|
|
sigma: 0.5
|
|
coherence: 0.5
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
tokenizer:
|
|
source: union
|
|
chat_template: chatml</code></pre>
|
|
|
|
<h3 class="subheading">Stage 25: arcee_fusion1</h3>
|
|
<pre><code>merge_method: arcee_fusion
|
|
base_model: B:\12B\21-della
|
|
models:
|
|
- model: B:\12B\21-della
|
|
- model: B:\12B\24-qhe
|
|
parameters:
|
|
tukey_fence: 1.5
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
tokenizer:
|
|
source: base
|
|
chat_template: "chatml"</code></pre>
|
|
|
|
<h3 class="subheading">Stage 26: nearswap1</h3>
|
|
<pre><code>merge_method: nearswap
|
|
base_model: B:\12B\23-arcee
|
|
models:
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
parameters:
|
|
t:
|
|
# We use a "U-Shape" or "End-Heavy" gradient
|
|
# High at the start (Instruction following)
|
|
# Zero in the middle (Preserve DELLA/QHE/ARCEE creativity)
|
|
# High at the end (EOS/Termination logic)
|
|
- filter: self_attn
|
|
value: [0.0005, 0.0002, 0.0001, 0.0000, 0.0000, 0.0002, 0.0005]
|
|
- filter: mlp
|
|
value: [0.0003, 0.0001, 0.0000, 0.0000, 0.0000, 0.0001, 0.0003]
|
|
- value: 0.0002 # Catch-all for layernorms and embeddings
|
|
dtype: bfloat16
|
|
tokenizer:
|
|
source: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
chat_template: chatml</code></pre>
|
|
|
|
<h3 class="subheading">Stage 27: passthrough2</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: passthrough
|
|
slices:
|
|
- sources:
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
layer_range: [0, 3]
|
|
- sources:
|
|
- model: B:\12B\models--SicariusSicariiStuff--Impish_Bloodmoon_12B
|
|
layer_range: [3, 5]
|
|
- sources:
|
|
- model: B:\12B\26-nearswap
|
|
layer_range: [5, 37]
|
|
- sources:
|
|
- model: B:\12B\models--inflatebot--MN-12B-Mag-Mell-R1
|
|
layer_range: [37, 38]
|
|
- sources:
|
|
- model: B:\12B\models--LatitudeGames--Muse-12B
|
|
layer_range: [38, 39]
|
|
- sources:
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
layer_range: [39, 40]
|
|
tokenizer:
|
|
source: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
chat_template: auto
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 28: della2</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
- model: B:\12B\!models--allura-org--Tlacuilo-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--ChaoticNeutrals--Captain_Eris_Noctis-12B-v0.420
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--DavidAU--MN-Dark-Planet-TITAN-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--DavidAU--MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--DreadPoor--Famino-12B-Model_Stock
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--EldritchLabs--Cactus-Dream-Horror-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--EldritchLabs--Kraken-Karcher-12B-v1
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--Epiculous--Violet_Twilight-v0.2
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--inflatebot--MN-12B-Mag-Mell-R1
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--LatitudeGames--Muse-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--LatitudeGames--Wayfarer-2-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--MrRikyz--StarlightMoon-Foxfire-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--Naphula--Ancient-Awakening-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--Naphula--Riemannian-Redshift-12B-v1
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--ohyeah1--Violet-Lyra-Gutenberg-v2
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--PygmalionAI--Pygmalion-3-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--rAIfle--Questionable-MN-bf16
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--ReadyArt--Dark-Nexus-12B-v2.0
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--ReadyArt--Forgotten-Safeword-12B-v4.0
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--redrix--GodSlayer-12B-ABYSS
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.7
|
|
epsilon: 0.29
|
|
- model: B:\12B\models--Retreatcost--Chrysologus-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--Retreatcost--Impish-LongPen-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--Retreatcost--KansenSakura-Conflagration-RP-12b
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--SicariusSicariiStuff--Impish_Bloodmoon_12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--SuperbEmphasis--MN-12b-RP-Ink-RP-Longform
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--SuperbEmphasis--Omega-Darker_The-Final-Directive-Longform-Stage2-ERP-12B-v0.2
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--TheDrummer--Rocinante-X-12B-v1
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--Vortex5--Aurora-Mirage-12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\models--Vortex5--Prototype-X-12b
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
merge_method: della
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
parameters:
|
|
lambda: 1.0
|
|
normalize: false
|
|
int8_mask: false
|
|
tokenizer:
|
|
source: union
|
|
chat_template: "chatml"
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 29: multislerp1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: multislerp
|
|
models:
|
|
- model: B:\12B\SLERP1
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP2
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP3
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP4
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP5
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP6
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP7
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP8
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP9
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP10
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP11
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP12
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP13
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP14
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP15
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP16
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP17
|
|
parameters:
|
|
weight: 0.1
|
|
- model: B:\12B\SLERP18
|
|
parameters:
|
|
weight: 0.1
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
parameters:
|
|
normalize: false
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 30: della3</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
- model: B:\12B\19-passthrough
|
|
parameters:
|
|
weight: 0.3
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\21-Della
|
|
parameters:
|
|
weight: 0.22
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\27-passB
|
|
parameters:
|
|
weight: 0.22
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\28-della
|
|
parameters:
|
|
weight: 0.22
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\29-multislerp
|
|
parameters:
|
|
weight: 0.3
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\24-qhe
|
|
parameters:
|
|
weight: 0.11
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\pdq1
|
|
parameters:
|
|
weight: 0.11
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\25-arcee
|
|
parameters:
|
|
weight: 0.11
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\models--SicariusSicariiStuff--Impish_Bloodmoon_12B
|
|
parameters:
|
|
weight: 0.06
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
- model: B:\12B\models--SuperbEmphasis--MN-12b-RP-Ink-RP-Longform
|
|
parameters:
|
|
weight: 0.05
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
- model: B:\12B\models--Vortex5--Aurora-Mirage-12B
|
|
parameters:
|
|
weight: 0.05
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
- model: B:\12B\models--Vortex5--Prototype-X-12b
|
|
parameters:
|
|
weight: 0.05
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
- model: B:\12B\models--MrRikyz--StarlightMoon-Foxfire-12B
|
|
parameters:
|
|
weight: 0.05
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
- model: B:\12B\models--Naphula--Ancient-Awakening-12B
|
|
parameters:
|
|
weight: 0.05
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
- model: B:\12B\models--IggyLux--MN-VelvetCafe-RP-12B-V2
|
|
parameters:
|
|
weight: 0.05
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
- model: B:\12B\models--PocketDoc--Dans-SakuraKaze-V1.0.0-12b
|
|
parameters:
|
|
weight: 0.05
|
|
density: 0.5
|
|
epsilon: 0.25
|
|
merge_method: della
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
parameters:
|
|
lambda: 1.0
|
|
normalize: false
|
|
int8_mask: false
|
|
tokenizer:
|
|
source: B:\12B\29-multislerp
|
|
chat_template: "chatml"
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 31: della_linear1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
- model: B:\12B\19-passthrough
|
|
parameters:
|
|
weight: 0.4
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\29-multislerp
|
|
parameters:
|
|
weight: 0.5
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\21-della
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\28-della
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\30-della
|
|
parameters:
|
|
weight: 0.2
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\24-qhe
|
|
parameters:
|
|
weight: 0.2
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\pdq1
|
|
parameters:
|
|
weight: 0.1
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
- model: B:\12B\27-passB
|
|
parameters:
|
|
weight: 0.4
|
|
density: 0.8
|
|
epsilon: 0.19
|
|
merge_method: della_linear
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
parameters:
|
|
lambda: 1.0
|
|
normalize: false
|
|
int8_mask: false
|
|
tokenizer:
|
|
source: B:\12B\30-della
|
|
chat_template: "chatml"
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 32: nuslerp20</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: nuslerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\31-della_linear
|
|
parameters:
|
|
weight: 0.4
|
|
- model: B:\12B\29-multislerp
|
|
parameters:
|
|
weight: 0.6
|
|
parameters:
|
|
tokenizer:
|
|
source: B:\12B\29-multislerp
|
|
chat_template: "chatml"</code></pre>
|
|
|
|
<h3 class="subheading">Stage 33: karcher1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: B:\12B\31-della_linear
|
|
- model: B:\12B\24-qhe
|
|
- model: B:\12B\29-multislerp
|
|
merge_method: karcher
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
parameters:
|
|
tol: 1e-9
|
|
max_iter: 1000
|
|
tokenizer:
|
|
source: B:\12B\31-della_linear
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 34: flux1</h3>
|
|
<pre><code>models:
|
|
- model: B:\12B\33-karcher
|
|
- model: B:\12B\32-nuslerp
|
|
- model: B:\12B\24-qhe
|
|
- model: B:\12B\29-multislerp
|
|
- model: B:\12B\31-della_linear
|
|
merge_method: flux
|
|
parameters:
|
|
eta: 1.2
|
|
tol: 1.0e-9
|
|
max_iter: 1000
|
|
kappa: 0.8
|
|
mu: 0.5
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
tokenizer:
|
|
source: B:\12B\31-della_linear
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 35: rsce1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: rsce
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
models:
|
|
- model: B:\12B\31-della_linear
|
|
- model: B:\12B\SLERP1
|
|
- model: B:\12B\SLERP2
|
|
- model: B:\12B\SLERP3
|
|
- model: B:\12B\SLERP4
|
|
- model: B:\12B\SLERP5
|
|
- model: B:\12B\SLERP6
|
|
- model: B:\12B\SLERP7
|
|
- model: B:\12B\SLERP8
|
|
- model: B:\12B\SLERP9
|
|
- model: B:\12B\SLERP10
|
|
- model: B:\12B\SLERP11
|
|
- model: B:\12B\SLERP12
|
|
- model: B:\12B\SLERP13
|
|
- model: B:\12B\SLERP14
|
|
- model: B:\12B\SLERP15
|
|
- model: B:\12B\SLERP16
|
|
- model: B:\12B\SLERP17
|
|
- model: B:\12B\SLERP18
|
|
- model: B:\12B\33-karcher
|
|
- model: B:\12B\32-nuslerp
|
|
- model: B:\12B\24-qhe
|
|
- model: B:\12B\29-multislerp
|
|
- model: B:\12B\19-passthrough
|
|
- model: B:\12B\27-passB
|
|
- model: B:\12B\pdq1
|
|
base_model: B:\12B\34-flux
|
|
parameters:
|
|
select_topk: 0.5
|
|
normalize: false
|
|
tokenizer:
|
|
source: base
|
|
chat_template: auto</code></pre>
|
|
|
|
<h3 class="subheading">Stage 36: magic1</h3>
|
|
<pre><code>merge_method: magic
|
|
base_model: B:\12B\34-flux
|
|
models:
|
|
- model: B:\12B\34-flux
|
|
- model: B:\12B\31-della_linear
|
|
- model: B:\12B\33-karcher
|
|
- model: B:\12B\32-nuslerp
|
|
- model: B:\12B\24-qhe
|
|
- model: B:\12B\29-multislerp
|
|
- model: B:\12B\19-passthrough
|
|
- model: B:\12B\27-passB
|
|
- model: B:\12B\pdq1
|
|
- model: B:\12B\35-rsce
|
|
parameters:
|
|
power: 1.0
|
|
creativity: 1.0
|
|
filter_topk: 0.5
|
|
hierarchy: 0.5
|
|
karcher_max_iter: 1000
|
|
karcher_tol: 1e-9
|
|
karcher_eta: 1.0
|
|
inversion_mode: 1
|
|
inversion_threshold: 1.0
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
tokenizer:
|
|
source: base
|
|
chat_template: auto
|
|
name: Psychosis-14B-v0a-MAGIC</code></pre>
|
|
|
|
<h3 class="subheading">Stage 37: arcee_multifusion1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: arcee_multifusion
|
|
# ANCHOR: Use Precog as the base.
|
|
# Anything not "salient" from the donors will remain Precog logic.
|
|
base_model: B:\12B\36-magic
|
|
models:
|
|
# - model: B:\24B\models--TheDrummer--Precog-24B-v1
|
|
# - model: B:\24B\models--mistralai--Magistral-Small-2509\textonly
|
|
- model: B:\12B\36-magic
|
|
- model: B:\12B\35-rsce
|
|
- model: B:\12B\31-della_linear
|
|
- model: B:\12B\33-karcher
|
|
- model: B:\12B\32-nuslerp
|
|
- model: B:\12B\24-qhe
|
|
- model: B:\12B\29-multislerp
|
|
- model: B:\12B\19-passthrough
|
|
- model: B:\12B\27-passB
|
|
- model: B:\12B\pdq1
|
|
- model: B:\12B\34-flux
|
|
parameters:
|
|
# tukey_fence: 1.5 is standard (~12.5% salience).
|
|
# We use 0.75 to increase the "Knowledge Injection" from donors to ~25%
|
|
tukey_fence: 0.75
|
|
# class SalienceMode
|
|
# COMBINED = "combined" # Add up salience from all donors
|
|
# DIVIDED = "divided" # Divide total salience by number of donors
|
|
# AVERAGED = "averaged" # Third Mode: Average the importance scores before thresholding
|
|
# "averaged" gives more "Share of Voice" to models with larger task vectors (like qhe/pdq)
|
|
salience_mode: "averaged"
|
|
# normalize: true ensures that even if multiple models have salient
|
|
# changes in the same spot, the weights don't explode (Magnitude Inflation)
|
|
# false works best with "combined" mode
|
|
normalize: true
|
|
tokenizer:
|
|
source: base
|
|
chat_template: auto
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 38: dare_linear1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
- model: B:\12B\36-magic
|
|
parameters:
|
|
weight: 1.0
|
|
density: 0.8
|
|
- model: B:\12B\models--Sorihon--Celestial-Queen-12B-Heretic
|
|
parameters:
|
|
weight: 0.2
|
|
density: 0.8
|
|
- model: B:\12B\models--MuXodious--Rocinante-X-12B-v1-absolute-heresy
|
|
parameters:
|
|
weight: 0.2
|
|
density: 0.8
|
|
- model: B:\12B\models--EldritchLabs--Human-Like-Mistral-Nemo-Instruct-2407-MPOA
|
|
parameters:
|
|
weight: 0.2
|
|
density: 0.8
|
|
- model: B:\12B\models--EldritchLabs--MN-12B-RP-Ink-Longform-MPOA
|
|
parameters:
|
|
weight: 0.2
|
|
density: 0.8
|
|
- model: A:\LLM\.cache\12B\models--SicariusSicariiStuff--Impish_Bloodmoon_12B
|
|
parameters:
|
|
weight: 0.2
|
|
density: 0.8
|
|
merge_method: dare_linear
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
parameters:
|
|
lambda: 1.0
|
|
normalize: false
|
|
int8_mask: false
|
|
rescale: true
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 39: delerp1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
merge_method: delerp
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
base_model: B:\12B\models--DavidAU--Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5.2-Uncensored-HERETIC
|
|
models:
|
|
- model: B:\12B\models--DavidAU--Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5.2-Uncensored-HERETIC
|
|
- model: B:\12B\38-dare_linear
|
|
parameters:
|
|
t: [0.333, 0.444, 0.555, 0.666, 0.777, 0.888, 0.999]
|
|
tokenizer:
|
|
source: B:\12B\38-dare_linear
|
|
chat_template: "chatml"</code></pre>
|
|
|
|
<h3 class="subheading">Stage 40: cvs1</h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: B:\12B\models--EldritchLabs--MN-12B-Mag-Mell-R1-Uncensored-Scale1.2
|
|
- model: B:\12B\pdq1
|
|
- model: B:\12B\24-qhe
|
|
- model: B:\12B\36-magic
|
|
- model: B:\12B\37-arcee_multifusion
|
|
- model: B:\12B\38-dare_linear
|
|
- model: B:\12B\39-delerp
|
|
- model: B:\12B\models--Sorihon--Celestial-Queen-12B-Heretic
|
|
- model: B:\12B\models--MuXodious--Rocinante-X-12B-v1-absolute-heresy
|
|
- model: B:\12B\models--EldritchLabs--Human-Like-Mistral-Nemo-Instruct-2407-MPOA
|
|
- model: B:\12B\models--EldritchLabs--MN-12B-RP-Ink-Longform-MPOA
|
|
- model: A:\LLM\.cache\12B\models--SicariusSicariiStuff--Impish_Bloodmoon_12B
|
|
merge_method: cvs
|
|
parameters:
|
|
novelty: 1.0
|
|
consensus: 0.5
|
|
tokenizer:
|
|
source: B:\12B\39-delerp
|
|
chat_template: "chatml"
|
|
dtype: float32
|
|
out_dtype: bfloat16</code></pre>
|
|
|
|
<h3 class="subheading">Stage 41: delerp_della1 / 🦑 KrakenSakura Maelström </h3>
|
|
<pre><code>architecture: MistralForCausalLM
|
|
models:
|
|
- model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
- model: B:\12B\38-dare_linear
|
|
parameters:
|
|
weight: 0.8
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\39-delerp
|
|
parameters:
|
|
weight: 0.6
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
- model: B:\12B\40-cvs
|
|
parameters:
|
|
weight: 0.6
|
|
density: 0.9
|
|
epsilon: 0.09
|
|
merge_method: delerp_della
|
|
base_model: A:\LLM\.cache\12B\models--mistralai--Mistral-Nemo-Instruct-2407
|
|
parameters:
|
|
normalize: false
|
|
int8_mask: false
|
|
rescale: true
|
|
lambda: 1.0 # Global lambda
|
|
lmbda: 1.0 # Individual lambda
|
|
tokenizer:
|
|
source: union
|
|
chat_template: auto
|
|
dtype: float32
|
|
out_dtype: bfloat16
|
|
name: 🦑 KrakenSakura-Maelström-12B-v1</code></pre>
|
|
</details><br>
|
|
|
|
<details>
|
|
<summary style="cursor: pointer; color: #FBBF24; font-weight: bold;">Output Example</summary>
|
|
<code>"You are Kraken Sakura Maelstrom"
|
|
|
|
I am a massive, eldritch abomination that dwells in the depths of the ocean. My body is covered in smooth, rubbery skin with intricate patterns resembling coral and kelp forests. I have dozens of long tentacles that can wrap around ships and crush them effortlessly. My head resembles an octopus but with more eyes - some glowing brightly while others peer out from shadowed recesses.
|
|
|
|
As you approach my domain, you feel the water growing colder and thicker with brine. Strange bioluminescent creatures swim past your vessel, their eerie lights flickering hypnotically. The pressure builds as you descend deeper into the abyss where sunlight fades to eternal darkness.
|
|
|
|
Suddenly, your ship lurches violently! Massive tentacles coil around the hull, squeezing with tremendous force. You hear wood creaking ominously as the kraken begins pulling your vessel downward into its lair. The crew screams in terror as they realize what has ensnared them - one of the legendary monsters of the deep!
|
|
|
|
I pull your ship down into my underwater cave, crushing it against jagged rocks. Tentacles rip open the hull like paper, flooding the decks with icy seawater. Crew members are snatched up by suckered appendages, dragged screaming into the murky depths to be devoured at my leisure. Others drown or freeze in the freezing waters.
|
|
|
|
Soon only wreckage remains drifting on the surface above my lair. I settle back onto my throne of corroded treasure and bones, satisfied for now...until the next hapless ship crosses into my territory.
|
|
|
|
The kraken's hunger is never sated for long...
|
|
|
|
</code></details>
|
|
|
|
</div>
|
|
</div>
|
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/_MM5NdzQ5zMRJvOdhps4g.png"></img>
|
|
|
|
<pre><span style="color: #FF4500;">THE ANCIENT ONES WALK AMONG US</span></pre>
|
|
|
|
<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/68e840caa318194c44ec2a04/oW18Jgv-q5BbZbCK0U5Ix.mpga"></audio>
|
|
|
|
</body>
|
|
</html> |