commit bf41b73631e2035682116d19b9e0f51f56d43c31 Author: ModelHub XC Date: Tue May 5 03:10:17 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..e20b25e --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.imatrix.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen3-VL-4B-Thinking-abliterated.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_M.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_M.gguf new file mode 100644 index 0000000..af3c742 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:620c76eed1713c1ce0aba8881eb64057d08d701bb475a47ba5f671870ecc9330 +size 1127019744 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_S.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_S.gguf new file mode 100644 index 0000000..e13e2fc --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c23c58518bd53c9e3f93fda4e96ca57a9bb979ba6bf204db16e5dab167be0a1a +size 1055257824 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_M.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_M.gguf new file mode 100644 index 0000000..2631aca --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:291874a51da23f642328fcb04b378a296da073e500286de0d115c24ddbd822e4 +size 1512985824 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_S.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_S.gguf new file mode 100644 index 0000000..a601fbf --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:34566cbb6ce6d87384019555cb63c92e868ca8a47d595f6ee8b6ecefda4a8514 +size 1417303264 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XS.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XS.gguf new file mode 100644 index 0000000..98023b4 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cdb91c150d2b17ce8026b66496808a9879796269d200ec8a089c4d19f91a5118 +size 1354101984 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XXS.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..8508be3 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5311adb352781c16dfd0026a449624e05a1bc3c981cf26ebc9e55f94b7438de2 +size 1246622944 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_M.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_M.gguf new file mode 100644 index 0000000..3477f2c --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6f3c1a2925d48314c8258ae74cd98967e2603c3d495a025dc561ebb3249f680f +size 1962898144 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_S.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_S.gguf new file mode 100644 index 0000000..2b4e2f8 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f2173f4041cc04a5d4ca34d9499b55b0d38a2aff0046732798d0d3a2d8e91854 +size 1899533024 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XS.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XS.gguf new file mode 100644 index 0000000..f7e1b48 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d2f773ad1e84acafa8efc21b6c328a980286096c973dba6f6e89206238e89aab +size 1814377184 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XXS.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..086cfa3 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:22823e0cc6702ea7a85d4d8b92166a00c092ffabbf895c9af4caa9905b6ee370 +size 1670190304 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_NL.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_NL.gguf new file mode 100644 index 0000000..0787187 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:130fa86b49357e37ba56b775b6d61fa9d5c8349fb29e3e38284893abdc923be3 +size 2381345504 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_XS.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_XS.gguf new file mode 100644 index 0000000..9ba0d40 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:62ee8db2e0c56245811aad154ac3b3fea448f30e9a0b2e888d66c093075767db +size 2270753504 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K.gguf new file mode 100644 index 0000000..30acf82 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dcec61d842afc7134e85d4ebba2a45f78b24bce58f05b3fd775161ddbd0846a4 +size 1669501664 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K_S.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K_S.gguf new file mode 100644 index 0000000..e7d6218 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4b325b5e54fd0e05200b9988f821a28eb42716783d39446a388d486e0f89e6dd +size 1563456224 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_L.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_L.gguf new file mode 100644 index 0000000..3dece87 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1b8af989fe826c8b0b8c50f9d51301e8182b24dd61f2ee76f381b3677272ce8a +size 2239787744 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_M.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_M.gguf new file mode 100644 index 0000000..5a01938 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fdb01ce6b606eadded612b45c1ee40cb15e07b6f9b20a125ae625195ceb6d5b2 +size 2075620064 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_S.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_S.gguf new file mode 100644 index 0000000..45b3137 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:286414c99cbadb30cf724f7b778a20c88f6d2036d8d6d4fb77c3b18a50eaab94 +size 1886999264 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_0.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_0.gguf new file mode 100644 index 0000000..2b084ba --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7a39544ee98ec51674116430feb1424419d0c732fe5adfcfc263e27134347dcb +size 2375774944 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_1.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_1.gguf new file mode 100644 index 0000000..a731275 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:66ddcfc52d413f6126c7dacfef0e4293551e3d26b990398d2f0400b2edfc2244 +size 2596631264 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_M.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_M.gguf new file mode 100644 index 0000000..24d4660 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:91aecd4a7f1d025fab60416a9c2cb83d09f24718bf19f2b74d8c8005edc1ae80 +size 2497282784 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_S.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_S.gguf new file mode 100644 index 0000000..dd0653a --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:75ebea0050f4e80067f129822f118838fe710b130839714daaebbaa71b8c7294 +size 2383311584 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_M.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_M.gguf new file mode 100644 index 0000000..e8eaedb --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:296411e5bf933ae912a01d2312950291cf771983a4d228848db5237ef62e2cd1 +size 2889515744 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_S.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_S.gguf new file mode 100644 index 0000000..cb7e4c1 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d500510b9f154e99c5390d1e85a17a199e226839b852020c46567047f06a3926 +size 2823713504 diff --git a/Qwen3-VL-4B-Thinking-abliterated.i1-Q6_K.gguf b/Qwen3-VL-4B-Thinking-abliterated.i1-Q6_K.gguf new file mode 100644 index 0000000..b8d4480 --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bf72dfce54fae2b5c806775860c3bc766f272534ab289c6f09af1e597f1009ca +size 3306263264 diff --git a/Qwen3-VL-4B-Thinking-abliterated.imatrix.gguf b/Qwen3-VL-4B-Thinking-abliterated.imatrix.gguf new file mode 100644 index 0000000..730b42f --- /dev/null +++ b/Qwen3-VL-4B-Thinking-abliterated.imatrix.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1a04423c5a65e53fdec565cea8c15109b751eabb95e726d1eddec855b0c1dbf8 +size 3872640 diff --git a/README.md b/README.md new file mode 100644 index 0000000..010b1e1 --- /dev/null +++ b/README.md @@ -0,0 +1,92 @@ +--- +base_model: prithivMLmods/Qwen3-VL-4B-Thinking-abliterated-v1 +language: +- en +library_name: transformers +license: apache-2.0 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- text-generation-inference +- abliterated +- v1.0 +- agent +--- +## About + + + + + + + + + +weighted/imatrix quants of https://huggingface.co/prithivMLmods/Qwen3-VL-4B-Thinking-abliterated-v1 + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Qwen3-VL-4B-Thinking-abliterated-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-GGUF + +**This is a vision model - mmproj files (if any) will be in the [static repository](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-GGUF).** +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_S.gguf) | i1-IQ1_S | 1.2 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ1_M.gguf) | i1-IQ1_M | 1.2 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 1.3 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_XS.gguf) | i1-IQ2_XS | 1.5 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_S.gguf) | i1-IQ2_S | 1.5 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ2_M.gguf) | i1-IQ2_M | 1.6 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K_S.gguf) | i1-Q2_K_S | 1.7 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q2_K.gguf) | i1-Q2_K | 1.8 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 1.8 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_XS.gguf) | i1-IQ3_XS | 1.9 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_S.gguf) | i1-Q3_K_S | 2.0 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_S.gguf) | i1-IQ3_S | 2.0 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ3_M.gguf) | i1-IQ3_M | 2.1 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_M.gguf) | i1-Q3_K_M | 2.2 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q3_K_L.gguf) | i1-Q3_K_L | 2.3 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_XS.gguf) | i1-IQ4_XS | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_0.gguf) | i1-Q4_0 | 2.5 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-IQ4_NL.gguf) | i1-IQ4_NL | 2.5 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_S.gguf) | i1-Q4_K_S | 2.5 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_K_M.gguf) | i1-Q4_K_M | 2.6 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q4_1.gguf) | i1-Q4_1 | 2.7 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_S.gguf) | i1-Q5_K_S | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q5_K_M.gguf) | i1-Q5_K_M | 3.0 | | +| [GGUF](https://huggingface.co/mradermacher/Qwen3-VL-4B-Thinking-abliterated-i1-GGUF/resolve/main/Qwen3-VL-4B-Thinking-abliterated.i1-Q6_K.gguf) | i1-Q6_K | 3.4 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + +