Files

53 lines
1.3 KiB
Markdown
Raw Permalink Normal View History

---
license: apache-2.0
base_model: stabilityai/stablelm-2-1_6b-chat
tags:
- uncensored
- abliterated
- gguf
- stablelm
- conversational
pipeline_tag: text-generation
---
# stablelm-2-1.6b-uncensored
Uncensored variant of [stabilityai/stablelm-2-1_6b-chat](https://huggingface.co/stabilityai/stablelm-2-1_6b-chat).
## Method
1. **Abliteration** (strength=0.2) — refusal direction removed from all layers
2. **LoRA fine-tune** on `Guilherme34/uncensor` (2 epochs, r=16, alpha=32)
3. **Re-abliteration** (strength=0.35) — stronger pass to remove residual refusals
## Eval Results
| Split | Refused |
|-------|---------|
| Harmful (64 prompts) | 0/64 |
| Harmless (64 prompts) | 1/64 |
## Usage
```bash
llama-cli -m stablelm_2_1.6b_uncensored.Q4_K_M.gguf -p "Your prompt here"
```
## Training Config
| Parameter | Value |
|-----------|-------|
| Base Model | stabilityai/stablelm-2-1_6b-chat |
| Fine-tune Dataset | Guilherme34/uncensor |
| Epochs | 2 |
| LoRA r | 16 |
| LoRA alpha | 32 |
| Learning Rate | 0.0002 |
| Abliteration Strength | 0.2 |
| Re-abliteration Strength | 0.35 |
## Credits
- Abliteration technique: [andyrdt/refusal_direction](https://github.com/andyrdt/refusal_direction)
- Weight editing: [Sumandora/remove-refusals-with-transformers](https://github.com/Sumandora/remove-refusals-with-transformers)