From a64ac9274a52084d212bf99978c2336fcb2cb5c5 Mon Sep 17 00:00:00 2001 From: ModelHub XC Date: Sat, 9 May 2026 03:39:48 +0800 Subject: [PATCH] =?UTF-8?q?=E5=88=9D=E5=A7=8B=E5=8C=96=E9=A1=B9=E7=9B=AE?= =?UTF-8?q?=EF=BC=8C=E7=94=B1ModelHub=20XC=E7=A4=BE=E5=8C=BA=E6=8F=90?= =?UTF-8?q?=E4=BE=9B=E6=A8=A1=E5=9E=8B?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Model: mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF Source: Original Platform --- .gitattributes | 48 ++++++++++++++ Komodo-Llama-3.2-3B-v2-fp16.IQ4_XS.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q2_K.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q3_K_L.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q3_K_M.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q3_K_S.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q4_0_4_4.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q4_K_M.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q4_K_S.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q5_K_M.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q5_K_S.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q6_K.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.Q8_0.gguf | 3 + Komodo-Llama-3.2-3B-v2-fp16.f16.gguf | 3 + README.md | 79 +++++++++++++++++++++++ 15 files changed, 166 insertions(+) create mode 100644 .gitattributes create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.IQ4_XS.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q2_K.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q3_K_L.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q3_K_M.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q3_K_S.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q4_0_4_4.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q4_K_M.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q4_K_S.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q5_K_M.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q5_K_S.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q6_K.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.Q8_0.gguf create mode 100644 Komodo-Llama-3.2-3B-v2-fp16.f16.gguf create mode 100644 README.md diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..1a412f1 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,48 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.f16.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Komodo-Llama-3.2-3B-v2-fp16.Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Komodo-Llama-3.2-3B-v2-fp16.IQ4_XS.gguf b/Komodo-Llama-3.2-3B-v2-fp16.IQ4_XS.gguf new file mode 100644 index 0000000..27e7fea --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2df560c4eb727a0afc2a83426b12d881d924ea5c9f8416350c51da455c165b84 +size 1840907584 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q2_K.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q2_K.gguf new file mode 100644 index 0000000..598ab06 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:94ce854c7d9b497788bc2fee64daf2be8a0921ca5aaf8b0d36f9f65cda9dbcfd +size 1363936576 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_L.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_L.gguf new file mode 100644 index 0000000..6b9bd02 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7ad381d93a00b09340ad3c0b79194723d55c874ed5a1d94ba560391dd37a7262 +size 1815348544 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_M.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_M.gguf new file mode 100644 index 0000000..2c1777a --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9757e5bd99d152b7b1bb4c1ab68dbea84524a68f710c543f8ace55fb5323111f +size 1687160128 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_S.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_S.gguf new file mode 100644 index 0000000..c5bc030 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:19490d7d57a184937723c7a2e7f4411566c1d81c8597da4227da6142532b62af +size 1542849856 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q4_0_4_4.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q4_0_4_4.gguf new file mode 100644 index 0000000..037443b --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q4_0_4_4.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5945e4b069f846e2b418942e7efb3691cb7a5e8405953b229d0cba43d582d7a2 +size 1917191488 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_M.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_M.gguf new file mode 100644 index 0000000..f35193c --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:31bf282cd54a297c3ea3958543b1dd0558f049cf66ffdfc17ecadd2f0ff3ce86 +size 2019378496 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_S.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_S.gguf new file mode 100644 index 0000000..5e5b142 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e6b2a93fd5e997b2065fa72d9a23677decc10331ad0958ffbf0ed8c20101df79 +size 1928201536 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_M.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_M.gguf new file mode 100644 index 0000000..3f57462 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7922c2db212b07d4aa3138d41ff2fe61e8b2c4346e3f00b3ed0941c2f36c4659 +size 2322154816 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_S.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_S.gguf new file mode 100644 index 0000000..5bdd397 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c9297e3eb6ab2960b8fc2832c945aa6609e68d5476d4d2117b734f68f5abf37d +size 2269513024 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q6_K.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q6_K.gguf new file mode 100644 index 0000000..4167273 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:20c8df4ea8af15c7666040ef82e00680f3730dcd228e338bccb48b802523cf04 +size 2643854656 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.Q8_0.gguf b/Komodo-Llama-3.2-3B-v2-fp16.Q8_0.gguf new file mode 100644 index 0000000..c8ab730 --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:af5ba1fd532822bf7c10431d34c3db8c21d81382e486a02df6dd9a5140629dff +size 3421900096 diff --git a/Komodo-Llama-3.2-3B-v2-fp16.f16.gguf b/Komodo-Llama-3.2-3B-v2-fp16.f16.gguf new file mode 100644 index 0000000..fe7c3ea --- /dev/null +++ b/Komodo-Llama-3.2-3B-v2-fp16.f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5dc28eec7e56134718844ab809b623bf7c1ba2579cbcae5dcfa7a440fbf76f4b +size 6433688896 diff --git a/README.md b/README.md new file mode 100644 index 0000000..0c8d7b8 --- /dev/null +++ b/README.md @@ -0,0 +1,79 @@ +--- +base_model: suayptalha/Komodo-Llama-3.2-3B-v2-fp16 +datasets: +- jeggers/competition_math +language: +- en +- th +- pt +- es +- de +- fr +- it +- hi +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +tags: +- unsloth +- trl +- sft +- text-generation-inference +--- +## About + + + + + + +static quants of https://huggingface.co/suayptalha/Komodo-Llama-3.2-3B-v2-fp16 + + +weighted/imatrix quants are available at https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-i1-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q2_K.gguf) | Q2_K | 1.5 | | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_S.gguf) | Q3_K_S | 1.6 | | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_M.gguf) | Q3_K_M | 1.8 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q3_K_L.gguf) | Q3_K_L | 1.9 | | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.IQ4_XS.gguf) | IQ4_XS | 1.9 | | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q4_0_4_4.gguf) | Q4_0_4_4 | 2.0 | fast on arm, low quality | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_S.gguf) | Q4_K_S | 2.0 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q4_K_M.gguf) | Q4_K_M | 2.1 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_S.gguf) | Q5_K_S | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q5_K_M.gguf) | Q5_K_M | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q6_K.gguf) | Q6_K | 2.7 | very good quality | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.Q8_0.gguf) | Q8_0 | 3.5 | fast, best quality | +| [GGUF](https://huggingface.co/mradermacher/Komodo-Llama-3.2-3B-v2-fp16-GGUF/resolve/main/Komodo-Llama-3.2-3B-v2-fp16.f16.gguf) | f16 | 6.5 | 16 bpw, overkill | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. + +