初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF Source: Original Platform
This commit is contained in:
60
.gitattributes
vendored
Normal file
60
.gitattributes
vendored
Normal file
@@ -0,0 +1,60 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Falcon-H1-0.5B-Instruct-heretic.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_M.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:9acb645b1d34d0dd3a006dcd89709766b3faf94f650b9276726f42608c37e1da
|
||||
size 139399520
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_S.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c95057a9a90b51c4b6bd8532b5ecc566242418f53005ce9f8abd1cfd23ff7c93
|
||||
size 129511520
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_M.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4dbe234fdfa4753549a1b57791e21add408eb038d848e7002f86a5043a546cc3
|
||||
size 189753056
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_S.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:b7036029aee8671924e898ff699aafe1d5bc139dc1eb4958f66506f8510fbc53
|
||||
size 176569056
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XS.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f8168b1a2ae2af834072b54c2fcdbbd95d3d4c69bfac74bec37f7bdbc096ad5b
|
||||
size 169653344
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XXS.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:57935e88af9f3b9773da2e1abea50188c1616bfa0b6505dcd9e3e89ce411bc6c
|
||||
size 155879520
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_M.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c46671a9d3a43d8720bd6544c100160101cff1005732d9383f311463b993f07e
|
||||
size 243976544
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_S.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0e6b179fc2df93589b9c1b2c7f625f0e9115c01db6e17f88b7ee4b5691bc6476
|
||||
size 240355680
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XS.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:cae29330f250d77ef7600a911cf9e679aa33b680eae863e6956b1df4c0b5c327
|
||||
size 233941344
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XXS.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:5589c261647cef1efdfad82e0f9baa998163a3383a5f810f0308de16b7c344fc
|
||||
size 214253280
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_NL.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_NL.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d5e99bba6e1ec13a79d8e03e3c03b8aab0203051b6cfac5e679b95e7c06d429b
|
||||
size 305056992
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_XS.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:dae9ddd3dc5a2a9967345172020fb7c961e4e47f635666736b0f844c2933b939
|
||||
size 289971936
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:67e89d2d85d1876ab5456487f82bdd3945b61901d13dc8007ab61b15e9bcc10e
|
||||
size 200913888
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K_S.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:b615b0039433aab79669d40227295e249d7c4639cf92cadc2332c9219ecdaae3
|
||||
size 193295328
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_L.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:dfc343f3dd2517eaf3bfd1e5e4c2643b8512db4bb940e8908f1df13afd3dc287
|
||||
size 265275744
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_M.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:dc53fd7e5f9c9ca99fb7eb61209b2f4bacaee33c24c06d307a40a23b7714fddd
|
||||
size 253446496
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_S.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f5d58a4b95872ebf30a6e13bc250a1d7f8ba85807860e65884005495f9af48af
|
||||
size 239728992
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_0.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:dde5a6ae6e677818496c39646bc276686fbe7528037780c2ad81aa6f24d3ab0b
|
||||
size 304991456
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_1.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e8c264dca0c63ab3b2620fb629c3ae758ce3638bc28da7d5718e8c20275e64f0
|
||||
size 334932192
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_M.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e930a9c209d7aced9be1a63a81572f7da4e8f114a768f6d079d10d3cace00195
|
||||
size 314807520
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_S.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:bdd36724b0deb2e1169b33a36a8012394296a769542e96db9fac9b13521565ef
|
||||
size 305581280
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_M.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:461346043a6c3780d6226f0f383cf3a582f0fa692ee6582a1eaa8ce6cf656e9d
|
||||
size 370724064
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_S.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:77b3ead6bdc4665dfbe9f0a66ba31339d8d73a4d61f64da1a1c87fa62a4afd89
|
||||
size 365397216
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q6_K.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:22a0f5c4e1a5eb60ed99327c86d5b43f74cae49be128de34b02f80954d58f0f6
|
||||
size 430135392
|
||||
3
Falcon-H1-0.5B-Instruct-heretic.imatrix.gguf
Normal file
3
Falcon-H1-0.5B-Instruct-heretic.imatrix.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3d385082de0ad4692225f4e1f787d006adf5b23c6655cd873681512485b67166
|
||||
size 1524256
|
||||
93
README.md
Normal file
93
README.md
Normal file
@@ -0,0 +1,93 @@
|
||||
---
|
||||
base_model: megabytes/Falcon-H1-0.5B-Instruct-heretic
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
license: other
|
||||
license_link: https://falconllm.tii.ae/falcon-terms-and-conditions.html
|
||||
license_name: falcon-llm-license
|
||||
mradermacher:
|
||||
readme_rev: 1
|
||||
quantized_by: mradermacher
|
||||
tags:
|
||||
- falcon-h1
|
||||
- heretic
|
||||
- uncensored
|
||||
- decensored
|
||||
- abliterated
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
|
||||
<!-- ### quants_skip: -->
|
||||
<!-- ### skip_mmproj: -->
|
||||
weighted/imatrix quants of https://huggingface.co/megabytes/Falcon-H1-0.5B-Instruct-heretic
|
||||
|
||||
<!-- provided-files -->
|
||||
|
||||
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Falcon-H1-0.5B-Instruct-heretic-i1-GGUF).***
|
||||
|
||||
static quants are available at https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_S.gguf) | i1-IQ1_S | 0.2 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ1_M.gguf) | i1-IQ1_M | 0.2 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_S.gguf) | i1-IQ2_S | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ2_M.gguf) | i1-IQ2_M | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.3 | very low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q2_K.gguf) | i1-Q2_K | 0.3 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.3 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.3 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_S.gguf) | i1-IQ3_S | 0.3 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ3_M.gguf) | i1-IQ3_M | 0.3 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.4 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.4 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q4_0.gguf) | i1-Q4_0 | 0.4 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.4 | prefer IQ4_XS |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.4 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.4 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q4_1.gguf) | i1-Q4_1 | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_S.gguf) | i1-Q5_K_S | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q5_K_M.gguf) | i1-Q5_K_M | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Falcon-H1-0.5B-Instruct-heretic-i1-GGUF/resolve/main/Falcon-H1-0.5B-Instruct-heretic.i1-Q6_K.gguf) | i1-Q6_K | 0.5 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
Reference in New Issue
Block a user