初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF Source: Original Platform
This commit is contained in:
59
.gitattributes
vendored
Normal file
59
.gitattributes
vendored
Normal file
@@ -0,0 +1,59 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.imatrix.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
LFM2-24B-A2B-CinderWatch.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ1_M.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:094c8fe5ed62ff60f3d891c1e3495768c260b3e2bffd2ff864d3037c26b00415
|
||||||
|
size 5377872224
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ1_S.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:b6390cd2f977ab6ee1842365b5ec8d76b7d392f9bfeef6f53a32c66063ae9de1
|
||||||
|
size 4838822240
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_M.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:d727e681ed2529e491a5f66052b517e9b1a7e2c4b531369691ea8229e9b21931
|
||||||
|
size 7787204960
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_S.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:8f1387b3823c71785981558ea7c13e1a425ff235bfb5b90bbeb72690a267801c
|
||||||
|
size 7068471648
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_XS.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:0025750d565182c324cdbbd1962b020af0ba7aef3dd2c22b4d481e1a455cbd09
|
||||||
|
size 6996332896
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_XXS.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:75dc43d7d871125278f9257e0fcc472de4473c5f020ed781cb6a3b9bcef2c67c
|
||||||
|
size 6276288864
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_M.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:dd414e9dc2d47bcf72200abf1d7a45bc2b4520752d75edbd3aa278d6cd842ddf
|
||||||
|
size 10412790112
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_S.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:1a716e8d1912193fa1143717708c10264cf926f8f1c61224de11e1f2266ed5de
|
||||||
|
size 10319204704
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_XS.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:fe541fc886739d609e3a1a1501fd0c5d35ac12dd0ef82a91aadd5b0e8aac96c7
|
||||||
|
size 9750516064
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_XXS.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:f84c6839f9893871e02d06136c38c832f0c5700fa98ab4e00640264864cacc0c
|
||||||
|
size 9188938080
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-IQ4_XS.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:6ba62edd4c9799fcccdf338cfe12c6a42b99f7bdf7339f3d27fe80a28d211504
|
||||||
|
size 12726653280
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q2_K.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:16ffddafc54dbfa2120a3920e20c3e5c25a0966b6a2e22ae0ad923846f7399f4
|
||||||
|
size 8698974560
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q2_K_S.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:008f811b73c7d25ba7851041719d25fb0fdcb7b37433063b7f661cda1885b71b
|
||||||
|
size 8064618848
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q3_K_L.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:fc4b7f7cf360a161e30365bb4f648e43c2013989e565e58bf1705163c37b9dda
|
||||||
|
size 12317528416
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q3_K_M.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:010aa3bed217c94dc069a7903980f75c6573d047dd08a91faa2c7431a2b787d0
|
||||||
|
size 11354935648
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q3_K_S.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:589f643eb9bf62dfa946729cf5aeac79d66f8d1e2da1484e918fc2d77e2c445a
|
||||||
|
size 10319204704
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q4_0.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5f299aea99a649b6723ba243d6ea22d1d6126adaaf5fe2312346f5a222ee96f4
|
||||||
|
size 13508170080
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q4_1.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:07fb16e6ee93c33071ebf51a6d19d366ce53ade9170608a43cdbd5323eed4b20
|
||||||
|
size 14948913504
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q4_K_M.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:e4bda1319ccc9cfa8357324b49d8ada065efc128241916e763feb2fec8e52e9c
|
||||||
|
size 14415475040
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q4_K_S.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:5987a6c7cfac108f54263c0ac92ecdec5d1d256e9f5da9a36d0fd369b753da86
|
||||||
|
size 13549457760
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q5_K_M.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:f74005dd32cbe22cf99ff01425741d75f9d1f307a57332ca31ad2a03e913c202
|
||||||
|
size 16918819168
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q5_K_S.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:bdbfa5399a9a3cdd07f0574f6e03564b19207f9605bda6f3300632d2e1eecaa3
|
||||||
|
size 16430420320
|
||||||
3
LFM2-24B-A2B-CinderWatch.i1-Q6_K.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:f933f6400578f7c9c98d4f942bbb98f93afe58c6b2469a1fe733a563d6d03feb
|
||||||
|
size 19578622304
|
||||||
3
LFM2-24B-A2B-CinderWatch.imatrix.gguf
Normal file
3
LFM2-24B-A2B-CinderWatch.imatrix.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:ddce64eef30f42ac140d8cd91b2592a07ddb988c13b59f2ff5a2529dcbe4a401
|
||||||
|
size 56115392
|
||||||
83
README.md
Normal file
83
README.md
Normal file
@@ -0,0 +1,83 @@
|
|||||||
|
---
|
||||||
|
base_model: blascotobasco/LFM2-24B-A2B-CinderWatch
|
||||||
|
language:
|
||||||
|
- en
|
||||||
|
library_name: transformers
|
||||||
|
mradermacher:
|
||||||
|
readme_rev: 1
|
||||||
|
quantized_by: mradermacher
|
||||||
|
---
|
||||||
|
## About
|
||||||
|
|
||||||
|
<!-- ### quantize_version: 2 -->
|
||||||
|
<!-- ### output_tensor_quantised: 1 -->
|
||||||
|
<!-- ### convert_type: hf -->
|
||||||
|
<!-- ### vocab_type: -->
|
||||||
|
<!-- ### tags: nicoboss -->
|
||||||
|
<!-- ### quants: Q2_K IQ3_M Q4_K_S IQ3_XXS Q3_K_M small-IQ4_NL Q4_K_M IQ2_M Q6_K IQ4_XS Q2_K_S IQ1_M Q3_K_S IQ2_XXS Q3_K_L IQ2_XS Q5_K_S IQ2_S IQ1_S Q5_K_M Q4_0 IQ3_XS Q4_1 IQ3_S -->
|
||||||
|
<!-- ### quants_skip: -->
|
||||||
|
<!-- ### skip_mmproj: -->
|
||||||
|
weighted/imatrix quants of https://huggingface.co/blascotobasco/LFM2-24B-A2B-CinderWatch
|
||||||
|
|
||||||
|
<!-- provided-files -->
|
||||||
|
|
||||||
|
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#LFM2-24B-A2B-CinderWatch-i1-GGUF).***
|
||||||
|
|
||||||
|
static quants are available at https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-GGUF
|
||||||
|
## Usage
|
||||||
|
|
||||||
|
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||||
|
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||||
|
more details, including on how to concatenate multi-part files.
|
||||||
|
|
||||||
|
## Provided Quants
|
||||||
|
|
||||||
|
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||||
|
|
||||||
|
| Link | Type | Size/GB | Notes |
|
||||||
|
|:-----|:-----|--------:|:------|
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.imatrix.gguf) | imatrix | 0.2 | imatrix file (for creating your own quants) |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ1_S.gguf) | i1-IQ1_S | 4.9 | for the desperate |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ1_M.gguf) | i1-IQ1_M | 5.5 | mostly desperate |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 6.4 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ2_XS.gguf) | i1-IQ2_XS | 7.1 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ2_S.gguf) | i1-IQ2_S | 7.2 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ2_M.gguf) | i1-IQ2_M | 7.9 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q2_K_S.gguf) | i1-Q2_K_S | 8.2 | very low quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q2_K.gguf) | i1-Q2_K | 8.8 | IQ3_XXS probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 9.3 | lower quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ3_XS.gguf) | i1-IQ3_XS | 9.9 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ3_S.gguf) | i1-IQ3_S | 10.4 | beats Q3_K* |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q3_K_S.gguf) | i1-Q3_K_S | 10.4 | IQ3_XS probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ3_M.gguf) | i1-IQ3_M | 10.5 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q3_K_M.gguf) | i1-Q3_K_M | 11.5 | IQ3_S probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q3_K_L.gguf) | i1-Q3_K_L | 12.4 | IQ3_M probably better |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-IQ4_XS.gguf) | i1-IQ4_XS | 12.8 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q4_0.gguf) | i1-Q4_0 | 13.6 | fast, low quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q4_K_S.gguf) | i1-Q4_K_S | 13.6 | optimal size/speed/quality |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q4_K_M.gguf) | i1-Q4_K_M | 14.5 | fast, recommended |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q4_1.gguf) | i1-Q4_1 | 15.0 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q5_K_S.gguf) | i1-Q5_K_S | 16.5 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q5_K_M.gguf) | i1-Q5_K_M | 17.0 | |
|
||||||
|
| [GGUF](https://huggingface.co/mradermacher/LFM2-24B-A2B-CinderWatch-i1-GGUF/resolve/main/LFM2-24B-A2B-CinderWatch.i1-Q6_K.gguf) | i1-Q6_K | 19.7 | practically like static Q6_K |
|
||||||
|
|
||||||
|
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||||
|
types (lower is better):
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
And here are Artefact2's thoughts on the matter:
|
||||||
|
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||||
|
|
||||||
|
## FAQ / Model Request
|
||||||
|
|
||||||
|
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||||
|
questions you might have and/or if you want some other model quantized.
|
||||||
|
|
||||||
|
## Thanks
|
||||||
|
|
||||||
|
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||||
|
me use its servers and providing upgrades to my workstation to enable
|
||||||
|
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||||
|
|
||||||
|
<!-- end -->
|
||||||
Reference in New Issue
Block a user