commit 64c7a0ee2847d24e34a9d987ae7414c2e4e66598 Author: ModelHub XC Date: Fri Jun 5 12:55:16 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/MicroThinker-8B-Preview-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..4d171dd --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +MicroThinker-8B-Preview.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/MicroThinker-8B-Preview.i1-IQ1_M.gguf b/MicroThinker-8B-Preview.i1-IQ1_M.gguf new file mode 100644 index 0000000..20d7fb9 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ada8fc51027aa4d088fd64ac4d2e73961ec43156922b96d52a7168eb2194255a +size 2161977600 diff --git a/MicroThinker-8B-Preview.i1-IQ1_S.gguf b/MicroThinker-8B-Preview.i1-IQ1_S.gguf new file mode 100644 index 0000000..58a1247 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3596e6c08fd5db50bee72f2060863e45229ae8e998957292773d627e811e69f3 +size 2019633408 diff --git a/MicroThinker-8B-Preview.i1-IQ2_M.gguf b/MicroThinker-8B-Preview.i1-IQ2_M.gguf new file mode 100644 index 0000000..eea60a2 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:43b3914f53d7b67e615d5a08def05e0861e19fe81a8094a5b120b6acf99450a2 +size 2948286720 diff --git a/MicroThinker-8B-Preview.i1-IQ2_S.gguf b/MicroThinker-8B-Preview.i1-IQ2_S.gguf new file mode 100644 index 0000000..f7a908a --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6f12c3567dbaf305bfe091e4b6718070fd5840c47a636538ec6e216bdff83f71 +size 2758494464 diff --git a/MicroThinker-8B-Preview.i1-IQ2_XS.gguf b/MicroThinker-8B-Preview.i1-IQ2_XS.gguf new file mode 100644 index 0000000..8e29008 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:031f6836b4b2a8ce80782c44f1e692c77b690ae9a52d1a6dca6776cf3dc18341 +size 2605787392 diff --git a/MicroThinker-8B-Preview.i1-IQ2_XXS.gguf b/MicroThinker-8B-Preview.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..4ca8ea5 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9cac399a465393fdf3517011029e715f5d6a346b15543e54740efb29b6c16861 +size 2399217920 diff --git a/MicroThinker-8B-Preview.i1-IQ3_M.gguf b/MicroThinker-8B-Preview.i1-IQ3_M.gguf new file mode 100644 index 0000000..7c07607 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a2d66220dd23c8c8e6ef2bedb632c811a66843c5700a38f81ab131d1cc49c6a +size 3784829184 diff --git a/MicroThinker-8B-Preview.i1-IQ3_S.gguf b/MicroThinker-8B-Preview.i1-IQ3_S.gguf new file mode 100644 index 0000000..13004ff --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d8c26e5e4e3aafd00979686b21ced7bb6df4d6b5693b2fc1aea8383cc9709436 +size 3682330880 diff --git a/MicroThinker-8B-Preview.i1-IQ3_XS.gguf b/MicroThinker-8B-Preview.i1-IQ3_XS.gguf new file mode 100644 index 0000000..212d039 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1382774e68d9e1a4a0768654f5b1dd55ba8ace501ccb141e74ad42aee686317a +size 3518753024 diff --git a/MicroThinker-8B-Preview.i1-IQ3_XXS.gguf b/MicroThinker-8B-Preview.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..4da832d --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cccadfd5d7fa5f23c0d6cdf1925833239a0517f4d6888587d33ea4c9c6b23728 +size 3274918144 diff --git a/MicroThinker-8B-Preview.i1-IQ4_NL.gguf b/MicroThinker-8B-Preview.i1-IQ4_NL.gguf new file mode 100644 index 0000000..c357e08 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0a75abb5bf8fa0e9774450a4f7c7e271d1e1e0615e3ff1a136fbba83ab3ea6df +size 4677994752 diff --git a/MicroThinker-8B-Preview.i1-IQ4_XS.gguf b/MicroThinker-8B-Preview.i1-IQ4_XS.gguf new file mode 100644 index 0000000..447cd0b --- /dev/null +++ b/MicroThinker-8B-Preview.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8f5c7296c3332b519c003d293e5a2f777cdad60a1c1cc3b1e30bd3a9eec0019b +size 4447668480 diff --git a/MicroThinker-8B-Preview.i1-Q2_K.gguf b/MicroThinker-8B-Preview.i1-Q2_K.gguf new file mode 100644 index 0000000..f34ba6a --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c50b4238ff290b7f5d6f01cf404b4c6f729d57953e2e0ce96da22177e138e2b1 +size 3179137280 diff --git a/MicroThinker-8B-Preview.i1-Q2_K_S.gguf b/MicroThinker-8B-Preview.i1-Q2_K_S.gguf new file mode 100644 index 0000000..abd6d89 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e0a704c00c45577f4291d6119c5eb17d7e029333867d5e3cc630f1dc0dc1dbd5 +size 2988820736 diff --git a/MicroThinker-8B-Preview.i1-Q3_K_L.gguf b/MicroThinker-8B-Preview.i1-Q3_K_L.gguf new file mode 100644 index 0000000..ada332e --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0b4aad04a93570721ca5e29d772ac242a47bfbd18cb63840f2d91ff59563dd6e +size 4321962240 diff --git a/MicroThinker-8B-Preview.i1-Q3_K_M.gguf b/MicroThinker-8B-Preview.i1-Q3_K_M.gguf new file mode 100644 index 0000000..462de2a --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:16053b2fcb7ddfd835e9de1a877e0cd4648eac6794e077722169392aad3e2a77 +size 4018923776 diff --git a/MicroThinker-8B-Preview.i1-Q3_K_S.gguf b/MicroThinker-8B-Preview.i1-Q3_K_S.gguf new file mode 100644 index 0000000..bcdcd78 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ce449bc14069b9053298864a6a11065e922b6ac5811e65f2841e21d8eef9e273 +size 3664505088 diff --git a/MicroThinker-8B-Preview.i1-Q4_0.gguf b/MicroThinker-8B-Preview.i1-Q4_0.gguf new file mode 100644 index 0000000..565e783 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:31b55ca46a413f656a0c78bffc4b6822f13458cad8822204b0b8c5564d2edd77 +size 4675897600 diff --git a/MicroThinker-8B-Preview.i1-Q4_1.gguf b/MicroThinker-8B-Preview.i1-Q4_1.gguf new file mode 100644 index 0000000..bdd7cd9 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c6924526cda250958d13f812d995a893c45a72f608c6e187cc5f9eb7ad286847 +size 5130258688 diff --git a/MicroThinker-8B-Preview.i1-Q4_K_M.gguf b/MicroThinker-8B-Preview.i1-Q4_K_M.gguf new file mode 100644 index 0000000..5e24d8b --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ce50bc63183a75fde8423f005f227fd819d6592e3abed1a3b923e36ac72c2a3e +size 4920740096 diff --git a/MicroThinker-8B-Preview.i1-Q4_K_S.gguf b/MicroThinker-8B-Preview.i1-Q4_K_S.gguf new file mode 100644 index 0000000..f0ac3dc --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0b1198ec0af470915e7c0b22c8b2c81ed5af4c01985f62b43fee5a4894f10278 +size 4692674816 diff --git a/MicroThinker-8B-Preview.i1-Q5_K_M.gguf b/MicroThinker-8B-Preview.i1-Q5_K_M.gguf new file mode 100644 index 0000000..aa2c31a --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:04e64f7d31e26bdc3055ed02fe7aac7098be71c3f83e19a564cab2d6fae2f575 +size 5732993280 diff --git a/MicroThinker-8B-Preview.i1-Q5_K_S.gguf b/MicroThinker-8B-Preview.i1-Q5_K_S.gguf new file mode 100644 index 0000000..8479a0a --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3d0fc21581d403e445324f973e9dedf3ff373bf17d4bf830a0b2dd063f17c740 +size 5599299840 diff --git a/MicroThinker-8B-Preview.i1-Q6_K.gguf b/MicroThinker-8B-Preview.i1-Q6_K.gguf new file mode 100644 index 0000000..2cb2a16 --- /dev/null +++ b/MicroThinker-8B-Preview.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d877be990ee731632b38e5ff0d8da51bcf1367b863452040712f9cf1f18f6dd8 +size 6596012288 diff --git a/README.md b/README.md new file mode 100644 index 0000000..d6f63cf --- /dev/null +++ b/README.md @@ -0,0 +1,80 @@ +--- +base_model: huihui-ai/MicroThinker-8B-Preview +datasets: +- huihui-ai/FineQwQ-142k +language: +- en +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +tags: +- llama3.1 +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/huihui-ai/MicroThinker-8B-Preview + + +static quants are available at https://huggingface.co/mradermacher/MicroThinker-8B-Preview-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ1_S.gguf) | i1-IQ1_S | 2.1 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ1_M.gguf) | i1-IQ1_M | 2.3 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.5 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.7 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ2_S.gguf) | i1-IQ2_S | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ2_M.gguf) | i1-IQ2_M | 3.0 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q2_K_S.gguf) | i1-Q2_K_S | 3.1 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q2_K.gguf) | i1-Q2_K | 3.3 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.6 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.8 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ3_S.gguf) | i1-IQ3_S | 3.8 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ3_M.gguf) | i1-IQ3_M | 3.9 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q3_K_M.gguf) | i1-Q3_K_M | 4.1 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.4 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.5 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q4_0.gguf) | i1-Q4_0 | 4.8 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.8 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.8 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q4_K_M.gguf) | i1-Q4_K_M | 5.0 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q4_1.gguf) | i1-Q4_1 | 5.2 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.7 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.8 | | +| [GGUF](https://huggingface.co/mradermacher/MicroThinker-8B-Preview-i1-GGUF/resolve/main/MicroThinker-8B-Preview.i1-Q6_K.gguf) | i1-Q6_K | 6.7 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..3c2825f --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c3e0bc63ecbdf47ea4f6085e9c32ae3b108c58a11a60324bff63c43f7eb71b74 +size 4988157