Files
MN-VelvetCafe-RP-12B-V2/README.md
ModelHub XC a23cc4774b 初始化项目,由ModelHub XC社区提供模型
Model: IggyLux/MN-VelvetCafe-RP-12B-V2
Source: Original Platform
2026-05-29 12:58:21 +08:00

203 lines
8.8 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
base_model:
- PocketDoc/Dans-PersonalityEngine-V1.3.0-12b
- kyx0r/Neona-12B
library_name: transformers
tags:
- mergekit
- merge
- roleplaying
- RP
- Writing
- creative
- story
- fiction
- mistral
- text
- adventure
- conversational
license: cc-by-4.0
---
#
![LuxBannerVelvetCafe](https://cdn-uploads.huggingface.co/production/uploads/65bbcee1320702b1043ef8ae/F-SR5UFdMWsHtiGxJRtlL.png)
![Velvet_Cafe](https://cdn-uploads.huggingface.co/production/uploads/65bbcee1320702b1043ef8ae/yo1cOV4_gg4I1AvBncAya.jpeg)
----------------------------------------------------------------------------------------------------------------------------------------------------------
<div style="font-size: 60px; font-weight:bold; text-align:center;">MN-VelvetCafe-RP-12B-V2</div>
----------------------------------------------------------------------------------------------------------------------------------------------------------
<div style="font-size: 20px;"> Thanks for the feedback — it helped fix two main issues in the previous release
- Bad preset ("Iggy's-RP-Preset")
- Apologies if you used it
- DRY sampler settings were wrong (different from what I actually tested)
- Likely caused by duplicating/renaming in SillyTavern — my mistake
- Wrong tokenizer for quants
- Accidentally used Mistral Nemo tokenizer instead of the base (Neona) one
- Caused formatting issues, especially under strong Dan's PE influence
- Fixed by: re-merging with identical SLERP config + using correct tokenizer from the start
- Then quantized (extensively tested on Q4_K_M)
- Old preset behavior (dry_multiplier=1, rep_pen=1.12, freq_pen=0.1):
- Early chat (first 1030 messages): crisp, varied, nice emphasis formatting
- Later chat: strong degradation
- excessive **bold** + *italic quoted speech*
- repetitive dramatic patterns
- forced/unnatural prose
- eventual chaos + noticeable quality drop
- This version should feel much more consistent
- Better format stability
- Significantly less degradation in long roleplays (when using proper sampler settings)
- Hope you enjoy the update — feedback always welcome!
- Next goals
- Experiment with other merge methods
- Try adding a 3rd model to increase response variety and quality</div>
----------------------------------------------------------------------------------------------------------------------------------------------------------
**Static Quants:**
https://huggingface.co/IggyLux/MN-VelvetCafe-RP-12B-V2-Q4_K_M
https://huggingface.co/IggyLux/MN-VelvetCafe-RP-12B-Q8_0-GGUF
----------------------------------------------------------------------------------------------------------------------------------------------------------
**About MN-VelvetCafe-RP-12B-V2**
This is my 5th merge attempt, I'm personally limited to 12B models due to 8GB VRAM
My preferred RP is focused on multi-character group chat RP (2+ characters)
**What makes this merge stand out:**
- Excellent scene/position/clothing tracking → immersive long-term RP
- Balanced, narrative-appropriate emotions (no random aggression/refusals)
- Reliable handling of author's notes & system prompts
**Goal:** Combine Dan's PE (strong character/clothes/personality consistency) with Neona (great style adaptation & instruction following) → visually detailed, consistent RP without losing emotional stability
**Big thanks to the creators — highly recommend trying both bases:**
- **Dans-PersonalityEngine-V1.3.0-12b**
- Extremely strong at natural dialogue, flowing descriptions, lively scenes
- Tuned heavily for RP & creative writing
- Minor downsides: occasional format/punctuation quirks + prefers shorter replies
- https://huggingface.co/PocketDoc/Dans-PersonalityEngine-V1.3.0-12b
- **Neona-12B**
- Very stable & adaptable (SFW slice-of-life ↔ NSFW ERP)
- No hair-trigger extremes or refusals
- Excellent system prompt / author's note obedience
- https://huggingface.co/kyx0r/Neona-12B
**Preferred SillyTavern Templates:**
- ChatML
- Mistral V3-Tekken
----------------------------------------------------------------------------------------------------------------------------------------------------------
![SamplerSettingsModelButton200](https://cdn-uploads.huggingface.co/production/uploads/65bbcee1320702b1043ef8ae/dxljI9mHZMzcFQsjvNl2F.png)
- **Sampler settings included with the model as a starting point**
- File: Iggy's_RP_PresetV2.json
- https://huggingface.co/IggyLux/MN-VelvetCafe-RP-12B-V2/blob/main/Iggy's_RP_PresetV2.json
- Context: 8192 (good balance for 8GB VRAM)
- Preferred Response length: 356
- Default Response length: 512
→ I usually start at 356 and the bump it to 512 for more detail or 1024 for climactic scenes
- **Key preset values**
- Temperature: 0.8 (default)
→ Lower to 0.40.6 for more grounded/less creative output — experiment! (I have gone as high as 1.3)
- **My personal setup**
- Backend: KoboldCpp (GGUF loading) → https://github.com/LostRuins/koboldcpp
- Frontend: SillyTavern (v1.16.0) → https://github.com/SillyTavern/SillyTavern
Feel free to use the preset as a base, but tweak it to match your taste — everyone's RP style is different!
----------------------------------------------------------------------------------------------------------------------------------------------------------
**Character Cards & Roleplay Usage/Examples**
**My approach (I call this the "Iggy format" since I rarely see others do it this way):**
- I avoid putting opening messages on character cards
- I avoid example dialogue entirely — especially any that speaks for the user or introduces third parties
**Why skip these:**
- Heavy example dialogue (especially interviewer-style) can make the model:
- Introduce unwanted extra characters
- Start speaking as the interviewer
- Or even speak for your character
- Example of problematic example-dialogue style: ![image](https://cdn-uploads.huggingface.co/production/uploads/65bbcee1320702b1043ef8ae/LasAhD9QTSR8MNE01F_GZ.png)
**Outdated bloated formats I skip:**
- 15002000 token cards full of P-lists, rigid bullets, personality tables, etc.
- Example of dense P-list style: ![image](https://cdn-uploads.huggingface.co/production/uploads/65bbcee1320702b1043ef8ae/GkghPuNEcZ_PxusavrxTn.png)
- Modern models read standard plain text formatting just fine — no need for over-engineered lists anymore
**How I actually start RPs in SillyTavern:**
1. Use /sys to send a narrative summary first (sets scenario before any messages)
2. Let the character send the real first message
3. If context limit pushes the /sys prompt out:
- Repurpose it into the Group Chat Scenario field
- Or condense into Author's Notes
----------------------------------------------------------------------------------------------------------------------------------------------------------
![ModelButtonIL200](https://cdn-uploads.huggingface.co/production/uploads/65bbcee1320702b1043ef8ae/H3TxfWVWCpjU1SXZAw6HX.png)
- 📖 https://github.com/aikohanasaki/SillyTavern-MemoryBooks/ - For keeping context low and saving older responses as Memories in a Lore Book
- 👀 https://github.com/leandrojofre/SillyTavern-Presence - For Group Chats: Using Presence lets you select what characters can see the user and char's messages.
- 🗣️ https://github.com/mattjaybe/SillyTavern-EchoChamber - New* I recently found this and thought it was pretty cool, you can have a chat comment on your RP.
- 🎯 https://github.com/Samueras/GuidedGenerations-Extension - Helps steer stubborn models, use guides to lock in scenes/details/clothing/positions and more.
This keeps immersion high and context efficient — especially for group/multi-character RP.
Years of low-VRAM local RP taught me these habits. If you hit any snags or want help troubleshooting, just ask!
----------------------------------------------------------------------------------------------------------------------------------------------------------
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [SLERP](https://en.wikipedia.org/wiki/Slerp) merge method.
### Models Merged
The following models were included in the merge:
* [PocketDoc/Dans-PersonalityEngine-V1.3.0-12b](https://huggingface.co/PocketDoc/Dans-PersonalityEngine-V1.3.0-12b)
* [kyx0r/Neona-12B](https://huggingface.co/kyx0r/Neona-12B)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: kyx0r/Neona-12B
- model: PocketDoc/Dans-PersonalityEngine-V1.3.0-12b
merge_method: slerp
base_model: kyx0r/Neona-12B
parameters:
t:
- value: 0.2
- filter: self_attn
value: [0, 0.2, 0.4, 0.6, 0.8, 1]
- filter: mlp
value: [1, 0.8, 0.6, 0.4, 0.2, 0]
dtype: bfloat16
chat_template: "chatml"
tokenizer:
source: "base"
```