commit 4ee82d38087af0263892784a5dbe9fdb8615b0fe Author: ModelHub XC Date: Thu May 14 01:28:25 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..9e7eb24 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.imatrix.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +LLaMA-3.1-turkis-8b.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/LLaMA-3.1-turkis-8b.i1-IQ1_M.gguf b/LLaMA-3.1-turkis-8b.i1-IQ1_M.gguf new file mode 100644 index 0000000..c9e4f41 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c4c3c7e8a9f9fcb76c31ecdbdedcec893fa03dd6807272b67272cbf667dbbd3f +size 2161973152 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ1_S.gguf b/LLaMA-3.1-turkis-8b.i1-IQ1_S.gguf new file mode 100644 index 0000000..7e6d8cf --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:647396b4902385844a80f34ec64890b308807864419b2f5422aa692e1350c439 +size 2019628960 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ2_M.gguf b/LLaMA-3.1-turkis-8b.i1-IQ2_M.gguf new file mode 100644 index 0000000..296e719 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:97dcbe619e3c28598190b2d1d9a79f058c06082eb093a3e3bf197bb56781e3e8 +size 2948282272 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ2_S.gguf b/LLaMA-3.1-turkis-8b.i1-IQ2_S.gguf new file mode 100644 index 0000000..a4bb29a --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3904a5531a5a2eaa10059240331c47887362d5967161809da667821ccdbb3b9d +size 2758490016 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ2_XS.gguf b/LLaMA-3.1-turkis-8b.i1-IQ2_XS.gguf new file mode 100644 index 0000000..e82a3cd --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d8400fbf29dbeaeea836f554d20deaa10dd75b22c9a14a2c17b5950f6d9edd47 +size 2605782944 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ2_XXS.gguf b/LLaMA-3.1-turkis-8b.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..f3e8b8d --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f4c75c18532a6da864457fd754951776fcfcd1dd0e7b9cbbabf0ba2e9b3dbd0e +size 2399213472 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ3_M.gguf b/LLaMA-3.1-turkis-8b.i1-IQ3_M.gguf new file mode 100644 index 0000000..6272efd --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:10d1186022f026df98d2892e88b4cfecff3984a5bf9789d7d98a4d263eb72270 +size 3784824736 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ3_S.gguf b/LLaMA-3.1-turkis-8b.i1-IQ3_S.gguf new file mode 100644 index 0000000..366806f --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7711d95068b88599976d0a6e32e56e63ffc4e4c3d4cc3835a4d5d8277fb34a9b +size 3682326432 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ3_XS.gguf b/LLaMA-3.1-turkis-8b.i1-IQ3_XS.gguf new file mode 100644 index 0000000..cbe6d68 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d11f12dabaca1092b23f5a9f6b0a5d3fae789527a8a4fe5da30099c123cca01f +size 3518748576 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ3_XXS.gguf b/LLaMA-3.1-turkis-8b.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..70987d2 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1b706a22a1b41d7c472b881d24e74df24c717e7b67e5aade6b3949ce56e70241 +size 3274913696 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ4_NL.gguf b/LLaMA-3.1-turkis-8b.i1-IQ4_NL.gguf new file mode 100644 index 0000000..8a898e1 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:50e55c97fd27aa5664821d6fc38eca3cdea1a64f458b8f2a189029709f13e7f2 +size 4677990304 diff --git a/LLaMA-3.1-turkis-8b.i1-IQ4_XS.gguf b/LLaMA-3.1-turkis-8b.i1-IQ4_XS.gguf new file mode 100644 index 0000000..d668b30 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:80407ca0f4c638b3f1001f142bea1f32152aaa3df8c669225a85913b615a2bfa +size 4447664032 diff --git a/LLaMA-3.1-turkis-8b.i1-Q2_K.gguf b/LLaMA-3.1-turkis-8b.i1-Q2_K.gguf new file mode 100644 index 0000000..7881152 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3221d077a2eca6b68be618565885cc11a59f648bc3e20ef934b23f6d0f5e693f +size 3179132832 diff --git a/LLaMA-3.1-turkis-8b.i1-Q2_K_S.gguf b/LLaMA-3.1-turkis-8b.i1-Q2_K_S.gguf new file mode 100644 index 0000000..a9f294f --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8c775df4da0f674d61669f577e5a9529ff69448cd9ad8c9126661af091044ea8 +size 2988816288 diff --git a/LLaMA-3.1-turkis-8b.i1-Q3_K_L.gguf b/LLaMA-3.1-turkis-8b.i1-Q3_K_L.gguf new file mode 100644 index 0000000..9cdeabd --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fe28c70faca411230536a8dad9e618983284a2523243988192f839073d9d93f4 +size 4321957792 diff --git a/LLaMA-3.1-turkis-8b.i1-Q3_K_M.gguf b/LLaMA-3.1-turkis-8b.i1-Q3_K_M.gguf new file mode 100644 index 0000000..108322c --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0dd6f17d046583f9ef209dd3710ba7c69756a312c5714d6cb5c39a7f40d2f455 +size 4018919328 diff --git a/LLaMA-3.1-turkis-8b.i1-Q3_K_S.gguf b/LLaMA-3.1-turkis-8b.i1-Q3_K_S.gguf new file mode 100644 index 0000000..e4e5256 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:de635cdc020cff51142653031f567f8db5f83cb9355b67454709a3eda890f739 +size 3664500640 diff --git a/LLaMA-3.1-turkis-8b.i1-Q4_0.gguf b/LLaMA-3.1-turkis-8b.i1-Q4_0.gguf new file mode 100644 index 0000000..31dfdb4 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:087a7fda978c9d11fe73d0df80bd498d77e1f69426d9241f2164c3ea2c2be3a0 +size 4675893152 diff --git a/LLaMA-3.1-turkis-8b.i1-Q4_1.gguf b/LLaMA-3.1-turkis-8b.i1-Q4_1.gguf new file mode 100644 index 0000000..a245851 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:41b598d10f6cefd3120e4210bd5fde3c59bae905e51df2df70441073fdb585ea +size 5130254240 diff --git a/LLaMA-3.1-turkis-8b.i1-Q4_K_M.gguf b/LLaMA-3.1-turkis-8b.i1-Q4_K_M.gguf new file mode 100644 index 0000000..70c794a --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4c259f2311805313ffd01107870eeee65e3223252da6cb93d2fd67845bc7f0e1 +size 4920735648 diff --git a/LLaMA-3.1-turkis-8b.i1-Q4_K_S.gguf b/LLaMA-3.1-turkis-8b.i1-Q4_K_S.gguf new file mode 100644 index 0000000..f06516b --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:945698373071e644353e59252b9b56430db79bd7fbad4c6ed032261c75c21c17 +size 4692670368 diff --git a/LLaMA-3.1-turkis-8b.i1-Q5_K_M.gguf b/LLaMA-3.1-turkis-8b.i1-Q5_K_M.gguf new file mode 100644 index 0000000..a19af73 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ce3a119aa2b9d101637b8c75b3190c38fb172deb9ef5a5ca8031f616d9a46396 +size 5732988832 diff --git a/LLaMA-3.1-turkis-8b.i1-Q5_K_S.gguf b/LLaMA-3.1-turkis-8b.i1-Q5_K_S.gguf new file mode 100644 index 0000000..4744f68 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2a95dac2637a0a6041ac408d2bbe65cb0b8a426426ab425a11afcb5c90df64b3 +size 5599295392 diff --git a/LLaMA-3.1-turkis-8b.i1-Q6_K.gguf b/LLaMA-3.1-turkis-8b.i1-Q6_K.gguf new file mode 100644 index 0000000..f64480f --- /dev/null +++ b/LLaMA-3.1-turkis-8b.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:18f701a740badd8f1cfdcde950f74e09b33d3f149d7d8601a7c398c3926c0ab2 +size 6596007840 diff --git a/LLaMA-3.1-turkis-8b.imatrix.gguf b/LLaMA-3.1-turkis-8b.imatrix.gguf new file mode 100644 index 0000000..52cfc63 --- /dev/null +++ b/LLaMA-3.1-turkis-8b.imatrix.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1421ef4327888b8ae842f19bfa4a79e22df8432d63cef2303e85a0c3ea9749f2 +size 5015200 diff --git a/README.md b/README.md new file mode 100644 index 0000000..11c531b --- /dev/null +++ b/README.md @@ -0,0 +1,93 @@ +--- +base_model: Ali-Yaser/LLaMA-3.1-turkis-8b +datasets: +- kadirnar/combined-turkish-datasets-v5 +language: +- tr +library_name: transformers +license: llama3.1 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- llama +- türkçe +- llm +- fine-tune +- LoRA +--- +## About + + + + + + + + + +weighted/imatrix quants of https://huggingface.co/Ali-Yaser/LLaMA-3.1-turkis-8b + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#LLaMA-3.1-turkis-8b-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ1_S.gguf) | i1-IQ1_S | 2.1 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ1_M.gguf) | i1-IQ1_M | 2.3 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.5 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.7 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ2_S.gguf) | i1-IQ2_S | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ2_M.gguf) | i1-IQ2_M | 3.0 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q2_K_S.gguf) | i1-Q2_K_S | 3.1 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q2_K.gguf) | i1-Q2_K | 3.3 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.6 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.8 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ3_S.gguf) | i1-IQ3_S | 3.8 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ3_M.gguf) | i1-IQ3_M | 3.9 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q3_K_M.gguf) | i1-Q3_K_M | 4.1 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.4 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.5 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q4_0.gguf) | i1-Q4_0 | 4.8 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.8 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.8 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q4_K_M.gguf) | i1-Q4_K_M | 5.0 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q4_1.gguf) | i1-Q4_1 | 5.2 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.7 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.8 | | +| [GGUF](https://huggingface.co/mradermacher/LLaMA-3.1-turkis-8b-i1-GGUF/resolve/main/LLaMA-3.1-turkis-8b.i1-Q6_K.gguf) | i1-Q6_K | 6.7 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + +