初始化项目,由ModelHub XC社区提供模型

Model: mlabonne/gemma-3-4b-it-abliterated-GGUF
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-19 14:59:19 +08:00
commit 0c08c039aa
9 changed files with 102 additions and 0 deletions

42
.gitattributes vendored Normal file
View File

@@ -0,0 +1,42 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
gemma-3-4b-it-abliterated.fp16.gguf filter=lfs diff=lfs merge=lfs -text
gemma-3-4b-it-abliterated.q2_k.gguf filter=lfs diff=lfs merge=lfs -text
gemma-3-4b-it-abliterated.q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text
gemma-3-4b-it-abliterated.q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
gemma-3-4b-it-abliterated.q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text
gemma-3-4b-it-abliterated.q6_k.gguf filter=lfs diff=lfs merge=lfs -text
gemma-3-4b-it-abliterated.q8_0.gguf filter=lfs diff=lfs merge=lfs -text

39
README.md Normal file
View File

@@ -0,0 +1,39 @@
---
license: gemma
library_name: transformers
pipeline_tag: image-text-to-text
base_model: google/gemma-3-4b-it
tags:
- autoquant
- gguf
---
# 💎 Gemma 3 4B IT Abliterated
![image/png](https://cdn-uploads.huggingface.co/production/uploads/61b8e2ba285851687028d395/WjFfc8hhj20r5XK07Yny9.png)
<center><a href="https://huggingface.co/mlabonne/gemma-3-12b-it-abliterated">Gemma 3 12B Abliterated</a><a href="https://huggingface.co/mlabonne/gemma-3-27b-it-abliterated">Gemma 3 27B Abliterated</a></center>
This is an uncensored version of [google/gemma-3-4b-it](https://huggingface.co/google/gemma-3-4b-it) created with a new abliteration technique.
See [this article](https://huggingface.co/blog/mlabonne/abliteration) to know more about abliteration.
I was playing with model weights and noticed that Gemma 3 was much more resilient to abliteration than other models like Qwen 2.5.
I experimented with a few recipes to remove refusals while preserving most of the model capabilities.
Note that this is fairly experimental, so it might not turn out as well as expected. I saw some garbled text from time to time (e.g., "It' my" instead of "It's my").
I recommend using these generation parameters: `temperature=1.0`, `top_k=64`, `top_p=0.95`.
## ✂️ Layerwise abliteration
![image/png](https://cdn-uploads.huggingface.co/production/uploads/61b8e2ba285851687028d395/fmwidMV9SvlStlsogCcea.png)
In the original technique, a refusal direction is computed by comparing the residual streams between target (harmful) and baseline (harmless) samples.
Here, the model was abliterated by computing a refusal direction based on hidden states (inspired by [Sumandora's repo](https://github.com/Sumandora/remove-refusals-with-transformers/)) for most layers (layer 7 to 29), independently.
This is combined with a refusal weight that follows a symmetric pattern from 0.05 to a peak of 0.55.
This created a very high acceptance rate (>90%) and still produced coherent outputs.
## ⚡️ Quantization
TBD.

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:7d718721f1310ab76ee32efed584f62b12001848dd8c6ce1a5817872ae4f2421
size 7767803776

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5b969e816f3f0644b74192655d7d19abf55501f3f8407e229a1546403b995310
size 1729164416

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3db3665fee1dcd0f37d17bb624bc099b8260366d7c927b3ac305880642f692a4
size 2098459776

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:99dc4abe5d344e1033f09e6a2c1f4e3d39093dd1f007b9d3254d8c2ef94950e1
size 2489894016

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:0218d5cbb0ae803e18d6b31243c39261f507208dfaf9364cae9d6f1d710500cb
size 2829698176

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:87f417fdd329e833a7c74f031359dde04da748d5bc50c67075e5b8b989243327
size 3190740096

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:2768a0446ecc0e51768f2836821dc75abafd342e234c3c7aa130ffdfe2a93c07
size 4130402176