初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF Source: Original Platform
This commit is contained in:
60
.gitattributes
vendored
Normal file
60
.gitattributes
vendored
Normal file
@@ -0,0 +1,60 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen3-0.6B-SFT-No-Thinking.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_M.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:75b8c9cd2060bf95fa5599204cfc40c8e2d17f0b82fb297d766bd3f7a2759a46
|
||||
size 216053056
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_S.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f8a06cae562884825ce81992e730741519fcf2865d29745cfeb23e306a65daf3
|
||||
size 208016704
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_M.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:33882f7d0c87df9fdecc605ada219c26be369ebed016a20d37baac122d359703
|
||||
size 264910144
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_S.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:bc53219230bf89707bc05e4a6662f6c18e0ce412c8a6090ccd739c21f23cbe89
|
||||
size 254195008
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XS.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:aad56d54215bf986888d775bc352ab6b121a98bba556378beff0eb9129e18fe9
|
||||
size 241997120
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XXS.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:014d8e508fcd1920c32a3737342d7c576de68bfb6cc3fe738cdaf5c058d07569
|
||||
size 229446976
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_M.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:99bd802f1fa2e77065e68c2295ec4c5bc496bb2b1c0c675e436e671214fd0ddc
|
||||
size 336027968
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_S.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3023955ce220776da6bfb36b226cd15571f5f79dd6950b315ded593450a0cdf3
|
||||
size 323076416
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XS.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:86b7c6a7a33e0831946f184a128dfc047a36a576fffcf3032a7602e67f590530
|
||||
size 312754496
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XXS.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0aade1caa5dffaffd5de4abd1031c6b30766ab851a87b0a73498ad8c8076a982
|
||||
size 279016768
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_NL.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_NL.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:684205fa04e858883d6079082197346a38b28bb4b6f042e31b5fe54ac73afac6
|
||||
size 381567296
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_XS.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:09a0a394ea74c0ba522ccc904d66a98f3763c70052aa46942b1cb08be55ad319
|
||||
size 367804736
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:360dec1209bd5ed8deddf345d2ee3ce8a070f6ef1b4f6f9ec24b2214438042f0
|
||||
size 296239424
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K_S.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f822d5564b34b4b9ecc66893e62ce7b21289111a8a6157d59088e4c3e7061786
|
||||
size 280559936
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_L.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:61a9208806118af18a8b8ed8b12bd4d2352193262a2eb000c071fbc4f4e448fb
|
||||
size 368492864
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_M.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e68ee278d2ec0831557ab695cd817559a6a1aa46364065558ff433c38a99ef6e
|
||||
size 347128128
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_S.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:51b45d9f7abb958a50c78d43dca92ed0d1ecc778d5de66dba0c1c5623aebfd6b
|
||||
size 323076416
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_0.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:7e6a0e69861ea91b484325ebcd29b76e095fb5d2a45b84770600aa4fe7126a72
|
||||
size 382157120
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_1.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:60be7a30084bcd78ad21e6767acfd93653db76b40fbf81025bc6bad939420ea1
|
||||
size 409092416
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_M.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:63632a1a33e08b1cfc8a6ed34a95c90ac37cb1f8d005d8d5156e042a0ead2dce
|
||||
size 396706112
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_S.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:11f269a7c7806fe8fc3bdb2764a06734a777695ef80a491e3a1b46e41c6c7242
|
||||
size 383271232
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_M.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:9ad887e6bdf9b41a2e6498798e47aa0fb15d283ea8cac80e3b576865fdb793d0
|
||||
size 444416320
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_S.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:7de1f9e96ebb33f02ea40832f45e10bae423a08ec6839afd403ff68dd71ef478
|
||||
size 436617536
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q6_K.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:6f2565c648dbde975f565687ad9779acb20b23bbe654709c5a1f3c5d7e249077
|
||||
size 495108416
|
||||
3
Qwen3-0.6B-SFT-No-Thinking.imatrix.gguf
Normal file
3
Qwen3-0.6B-SFT-No-Thinking.imatrix.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4b5aeacc7d4295c357a0d69c10e9087a55b93aa3e8318144bcac924e1878cdb1
|
||||
size 1177056
|
||||
88
README.md
Normal file
88
README.md
Normal file
@@ -0,0 +1,88 @@
|
||||
---
|
||||
base_model: Ducky30/Qwen3-0.6B-SFT-No-Thinking
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
mradermacher:
|
||||
readme_rev: 1
|
||||
quantized_by: mradermacher
|
||||
tags:
|
||||
- unsloth
|
||||
- trl
|
||||
- sft
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
|
||||
<!-- ### quants_skip: -->
|
||||
<!-- ### skip_mmproj: -->
|
||||
weighted/imatrix quants of https://huggingface.co/Ducky30/Qwen3-0.6B-SFT-No-Thinking
|
||||
|
||||
<!-- provided-files -->
|
||||
|
||||
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Qwen3-0.6B-SFT-No-Thinking-i1-GGUF).***
|
||||
|
||||
static quants are available at https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_S.gguf) | i1-IQ1_S | 0.3 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ1_M.gguf) | i1-IQ1_M | 0.3 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_S.gguf) | i1-IQ2_S | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ2_M.gguf) | i1-IQ2_M | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.4 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.4 | very low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q2_K.gguf) | i1-Q2_K | 0.4 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_S.gguf) | i1-IQ3_S | 0.4 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.4 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ3_M.gguf) | i1-IQ3_M | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.4 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.5 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.5 | prefer IQ4_XS |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q4_0.gguf) | i1-Q4_0 | 0.5 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.5 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.5 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q4_1.gguf) | i1-Q4_1 | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_S.gguf) | i1-Q5_K_S | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q5_K_M.gguf) | i1-Q5_K_M | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen3-0.6B-SFT-No-Thinking-i1-GGUF/resolve/main/Qwen3-0.6B-SFT-No-Thinking.i1-Q6_K.gguf) | i1-Q6_K | 0.6 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
Reference in New Issue
Block a user