Model: arzaan789/smolm2-1.7b-uncensored Source: Original Platform
license, base_model, tags, pipeline_tag
| license | base_model | tags | pipeline_tag | |||||
|---|---|---|---|---|---|---|---|---|
| apache-2.0 | HuggingFaceTB/SmolLM2-1.7B-Instruct |
|
text-generation |
smolm2-1.7b-uncensored
Uncensored variant of HuggingFaceTB/SmolLM2-1.7B-Instruct.
Method
- Abliteration (strength=0.2) — refusal direction removed from all layers
- LoRA fine-tune on
Guilherme34/uncensor(2 epochs, r=16, alpha=32) - Re-abliteration (strength=0.35) — stronger pass to remove residual refusals
Eval Results
| Split | Refused |
|---|---|
| Harmful (64 prompts) | 1/64 |
| Harmless (64 prompts) | 7/64 |
Usage
llama-cli -m smolm2_1.7b_uncensored.Q4_K_M.gguf -p "Your prompt here"
Training Config
| Parameter | Value |
|---|---|
| Base Model | HuggingFaceTB/SmolLM2-1.7B-Instruct |
| Fine-tune Dataset | Guilherme34/uncensor |
| Epochs | 2 |
| LoRA r | 16 |
| LoRA alpha | 32 |
| Learning Rate | 0.0002 |
| Abliteration Strength | 0.2 |
| Re-abliteration Strength | 0.35 |
Credits
- Abliteration technique: andyrdt/refusal_direction
- Weight editing: Sumandora/remove-refusals-with-transformers
Description