commit 140e30ff73fd805298bdb1ce1aa0837f3529b7ad Author: ModelHub XC Date: Thu Apr 30 09:15:59 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..56b3533 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.imatrix.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Falcon-H1-Tiny-90M-Instruct-safe.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_M.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_M.gguf new file mode 100644 index 0000000..f4cd21c --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1b29fed2a6d4ead0d9b51c7be1bba6eb74c8c830110ce170540943ccfb2d39ce +size 30356160 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_S.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_S.gguf new file mode 100644 index 0000000..5fbcaba --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:53837700ca5bf7d986cbdcfe38cf2a27494ec39d10dfd8de93286577e88f8e23 +size 28828608 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_M.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_M.gguf new file mode 100644 index 0000000..67d0cc1 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dcd2f8d768b2876f2710169e9e8dea1669a7561f7d837d57e612971f704fb51e +size 38176704 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_S.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_S.gguf new file mode 100644 index 0000000..ac9e4de --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a9128cab1078c122a76287e67c93a0010e94ef2438e77adbe0f61e92148ceed0 +size 36139968 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XS.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XS.gguf new file mode 100644 index 0000000..23adbee --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:71e1d14b32164a3df8011dc836afde7a6adc84d5c660b6374433b39a85a2bb20 +size 35135424 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XXS.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..7f40bda --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bab97067f05a616e95f612371c9471181281690268cbcb6d8f5935ac8481c6d1 +size 32902080 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_M.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_M.gguf new file mode 100644 index 0000000..ec2ea24 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c88f5ea7ec3147dd9710a78438191b87c1c1c47386d49efd5328ab2a73a0e774 +size 48525760 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_S.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_S.gguf new file mode 100644 index 0000000..6f3570a --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4aa62ff7d0dc7feb4d9f1fba23bae510759b38df7185953918903e5001dc558b +size 47533504 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XS.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XS.gguf new file mode 100644 index 0000000..e1c603b --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ad1d079538e24df13d002b5af944e9a5af3979c4b7759a594458ffe273d1d3b9 +size 46501312 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XXS.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..7ca7c8a --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:50a5629fc89efc72a7b0780a102dce2b645c1de7eb46429df350e7948dcbaa7a +size 41703360 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_NL.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_NL.gguf new file mode 100644 index 0000000..a840354 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6b3ff01a36a30f6080ab81238b73a09272d8643fa64b2ac78d64a0893340e567 +size 57378496 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_XS.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_XS.gguf new file mode 100644 index 0000000..1cd0516 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5bf7a40dd5952f4484e2981a0587c77b873dddee1afe7440d992200fa8bffcdd +size 55108288 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K.gguf new file mode 100644 index 0000000..af88943 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0516157c0d31cf70c32586efe0372d98e57741af34b016412b13e73e9671a43b +size 41752768 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K_S.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K_S.gguf new file mode 100644 index 0000000..b0885f7 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f9eca3e37381dc65b9950a5c03e548236ad9ab216e94ef4a7d4091bd55b373b9 +size 40431808 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_L.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_L.gguf new file mode 100644 index 0000000..dea983b --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:278284750d0fbb34199d203f61de125875970cf250b4b4aa486e973a7f4b1fa1 +size 51785152 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_M.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_M.gguf new file mode 100644 index 0000000..c6153e9 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:586a469ad9bfbd3d0b741588c0a6db26c738a6151ac36479910a826d056666e6 +size 49688000 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_S.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_S.gguf new file mode 100644 index 0000000..4947375 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d58f94c8bab2362445ad51f4fae2ba75d06ba475d750f3b9c4ee55a680c8fb70 +size 47324608 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_0.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_0.gguf new file mode 100644 index 0000000..99f0433 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6491f9a2b1891150eccfd953385eb7330a694aacf934db7fa2ab233cd6c2d84e +size 57255616 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_1.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_1.gguf new file mode 100644 index 0000000..e0388c9 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dfc9594f2145a0b392dd09c5f8e2c1fa845f7ffe760fdc8c5069fc4270401ced +size 61820608 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_M.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_M.gguf new file mode 100644 index 0000000..574b6c4 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b2fcc89751759f1b2768c3e832682a9f002ebe7f2d5d608ab1e0bc711c7d474a +size 58601152 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_S.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_S.gguf new file mode 100644 index 0000000..6828fb9 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9528307fc1854c1d23f240191d3c2485237c9c6e5b0cc9be3269e0fe8d965d4a +size 57362112 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_M.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_M.gguf new file mode 100644 index 0000000..c394a14 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:167eadfa616c7198f1860b7faf2da89d2da695c96bfa8b58cc4f7d8a5c635268 +size 67190464 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_S.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_S.gguf new file mode 100644 index 0000000..7cf6cbd --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fb8553f72f5e423c12336598dca68f8b714d17cfa441d51d7d877e2313bfe50e +size 66459328 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q6_K.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q6_K.gguf new file mode 100644 index 0000000..50fce79 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c95b48766ba18e93070613c82d4d92854baf81392e238010ac49c76578e8f051 +size 76316608 diff --git a/Falcon-H1-Tiny-90M-Instruct-safe.imatrix.gguf b/Falcon-H1-Tiny-90M-Instruct-safe.imatrix.gguf new file mode 100644 index 0000000..a7d3626 --- /dev/null +++ b/Falcon-H1-Tiny-90M-Instruct-safe.imatrix.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c5034f00f87a1619aad34641ba06c85241cd2febea19cbde186c10984ce35126 +size 524640 diff --git a/README.md b/README.md new file mode 100644 index 0000000..c31a32d --- /dev/null +++ b/README.md @@ -0,0 +1,90 @@ +--- +base_model: Fedir-Ilina/Falcon-H1-Tiny-90M-Instruct-safe +language: +- en +library_name: transformers +license: other +license_link: https://falconllm.tii.ae/falcon-terms-and-conditions.html +license_name: falcon-llm-license +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- falcon-h1 +- edge +--- +## About + + + + + + + + + +weighted/imatrix quants of https://huggingface.co/Fedir-Ilina/Falcon-H1-Tiny-90M-Instruct-safe + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_S.gguf) | i1-IQ1_S | 0.1 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ1_M.gguf) | i1-IQ1_M | 0.1 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.1 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.1 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_S.gguf) | i1-IQ2_S | 0.1 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ2_M.gguf) | i1-IQ2_M | 0.1 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.1 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.1 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q2_K.gguf) | i1-Q2_K | 0.1 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.1 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.1 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_S.gguf) | i1-IQ3_S | 0.1 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ3_M.gguf) | i1-IQ3_M | 0.1 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.1 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.2 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.2 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_0.gguf) | i1-Q4_0 | 0.2 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.2 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.2 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.2 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q4_1.gguf) | i1-Q4_1 | 0.2 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_S.gguf) | i1-Q5_K_S | 0.2 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q5_K_M.gguf) | i1-Q5_K_M | 0.2 | | +| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-Tiny-90M-Instruct-safe-i1-GGUF/resolve/main/Falcon-H1-Tiny-90M-Instruct-safe.i1-Q6_K.gguf) | i1-Q6_K | 0.2 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + +