commit 18fa4381fbca300a480b4023377126e06714c065 Author: ModelHub XC Date: Thu May 7 16:04:00 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/DeciLM-7B-instruct-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..a5df0fc --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +DeciLM-7B-instruct.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/DeciLM-7B-instruct.i1-IQ1_M.gguf b/DeciLM-7B-instruct.i1-IQ1_M.gguf new file mode 100644 index 0000000..f0d8a84 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7870e934968c92cd243facf9c63fadd06b6879a27938be6e3510fe8201010d4c +size 1677032800 diff --git a/DeciLM-7B-instruct.i1-IQ1_S.gguf b/DeciLM-7B-instruct.i1-IQ1_S.gguf new file mode 100644 index 0000000..5ad4d9c --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fbf8300e70e94501cdfe980fa8054e01984b8b4310b13d4445f5d4213acf732b +size 1537011040 diff --git a/DeciLM-7B-instruct.i1-IQ2_M.gguf b/DeciLM-7B-instruct.i1-IQ2_M.gguf new file mode 100644 index 0000000..0354235 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dc0d7e1fffc2b78c29a14ef884fe0446f1fce2a330d1d597ac39e6f6f9b2365b +size 2413235552 diff --git a/DeciLM-7B-instruct.i1-IQ2_S.gguf b/DeciLM-7B-instruct.i1-IQ2_S.gguf new file mode 100644 index 0000000..40bd7fd --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:25e0010bafe903ecd508c03291827e23fdbd4cef28191a875efa516dd0c6f15c +size 2226539872 diff --git a/DeciLM-7B-instruct.i1-IQ2_XS.gguf b/DeciLM-7B-instruct.i1-IQ2_XS.gguf new file mode 100644 index 0000000..0f8e144 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8b858d7e1082bb058b2335e1b53a13ef83e53596efe8a5398e8c98f82025d0ed +size 2113875296 diff --git a/DeciLM-7B-instruct.i1-IQ2_XXS.gguf b/DeciLM-7B-instruct.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..11bf608 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cab5f9bb6574b339c21dbca7e3b682faf2f06121b3b8678611ed34e0236f6fcf +size 1910402400 diff --git a/DeciLM-7B-instruct.i1-IQ3_M.gguf b/DeciLM-7B-instruct.i1-IQ3_M.gguf new file mode 100644 index 0000000..2ce0a66 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9d47dd8c774463f4fda01d0e91cfb49bff36267c9d7a556881c970eb30c48300 +size 3186576736 diff --git a/DeciLM-7B-instruct.i1-IQ3_S.gguf b/DeciLM-7B-instruct.i1-IQ3_S.gguf new file mode 100644 index 0000000..495bdf0 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1719175ffc60956a4a7e7ca21b09f468ae07ad13a1b1a5cc28cfcbd98ed00b9a +size 3084078432 diff --git a/DeciLM-7B-instruct.i1-IQ3_XS.gguf b/DeciLM-7B-instruct.i1-IQ3_XS.gguf new file mode 100644 index 0000000..4eddb78 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d1164d4881e17103617ffdb7b18cb37b6b776b81b759390ddfa21257ad6936c0 +size 2925145440 diff --git a/DeciLM-7B-instruct.i1-IQ3_XXS.gguf b/DeciLM-7B-instruct.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..3ec307b --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:95180b1e9ba62561d0df789bb9add574715ea9d3b937f84b7cb0e6aa4a658495 +size 2739866976 diff --git a/DeciLM-7B-instruct.i1-IQ4_NL.gguf b/DeciLM-7B-instruct.i1-IQ4_NL.gguf new file mode 100644 index 0000000..e5b2d09 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8a7ccc080e6a9ddb6666d33e66d0f2ae5c0f8da0c5c3c05161918380b754f65d +size 4001832288 diff --git a/DeciLM-7B-instruct.i1-IQ4_XS.gguf b/DeciLM-7B-instruct.i1-IQ4_XS.gguf new file mode 100644 index 0000000..17be6f5 --- /dev/null +++ b/DeciLM-7B-instruct.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a0b058e4b5b474fd783fe50408eb227b07ec1efb55e96775a9f9157b06f1a4b2 +size 3786923360 diff --git a/DeciLM-7B-instruct.i1-Q2_K.gguf b/DeciLM-7B-instruct.i1-Q2_K.gguf new file mode 100644 index 0000000..d31116f --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a26a5f9fd6838406b05e89f8a8ccb95c7c4572f1408370da51b0438b2622acdb +size 2630991200 diff --git a/DeciLM-7B-instruct.i1-Q2_K_S.gguf b/DeciLM-7B-instruct.i1-Q2_K_S.gguf new file mode 100644 index 0000000..2483d8e --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bdc3886ef0bb4df9a9d2126099d419821724683a433f3b7476d1b2745969f1b9 +size 2440674656 diff --git a/DeciLM-7B-instruct.i1-Q3_K_L.gguf b/DeciLM-7B-instruct.i1-Q3_K_L.gguf new file mode 100644 index 0000000..149e215 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f19441cfaecee8372b0034b1df4e2fe71dbf691f28a2a2c62941c24e0fc43a23 +size 3711323488 diff --git a/DeciLM-7B-instruct.i1-Q3_K_M.gguf b/DeciLM-7B-instruct.i1-Q3_K_M.gguf new file mode 100644 index 0000000..d48f009 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5bd54d5ad8ac5f8a00a60546af1f0695cfb7dac5b58b7aee59e9d5188ac4d1d9 +size 3420147040 diff --git a/DeciLM-7B-instruct.i1-Q3_K_S.gguf b/DeciLM-7B-instruct.i1-Q3_K_S.gguf new file mode 100644 index 0000000..a5b374f --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9a79c60d11ae1ebf9fa16463911f4b505b6853be13d164dc2574f79fa96b96c6 +size 3079413088 diff --git a/DeciLM-7B-instruct.i1-Q4_0.gguf b/DeciLM-7B-instruct.i1-Q4_0.gguf new file mode 100644 index 0000000..c914b44 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7e57504e11cdefd28b21a2e3a666cca43f0bc1327964e615a38660aa1740dc95 +size 4012121440 diff --git a/DeciLM-7B-instruct.i1-Q4_1.gguf b/DeciLM-7B-instruct.i1-Q4_1.gguf new file mode 100644 index 0000000..cde4328 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cc70da7309a9fae3142f31a5bcb0891af772738d7a161723d57c5a810ff4f331 +size 4429454688 diff --git a/DeciLM-7B-instruct.i1-Q4_K_M.gguf b/DeciLM-7B-instruct.i1-Q4_K_M.gguf new file mode 100644 index 0000000..a351104 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8ccd7cfdc582d8d7a5cbd5131a3a9e3bbd9679b3df381e161e4cd5fbad3e119a +size 4244528480 diff --git a/DeciLM-7B-instruct.i1-Q4_K_S.gguf b/DeciLM-7B-instruct.i1-Q4_K_S.gguf new file mode 100644 index 0000000..0b116fb --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:689bb65cd577bd5196b9da43b8496f97d582bd3d152cc32783f40e4c6ea0c4f9 +size 4027850080 diff --git a/DeciLM-7B-instruct.i1-Q5_K_M.gguf b/DeciLM-7B-instruct.i1-Q5_K_M.gguf new file mode 100644 index 0000000..ada6c59 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1999268edbe734b22790ddfeb0b25c0f3454ea156a0fb96d8c2303615b016111 +size 4988755296 diff --git a/DeciLM-7B-instruct.i1-Q5_K_S.gguf b/DeciLM-7B-instruct.i1-Q5_K_S.gguf new file mode 100644 index 0000000..17663a4 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:77795b22a019388b35fb0532b0812a085d7ddb473b1d8bb1c8a4ada8cd218ffa +size 4861468000 diff --git a/DeciLM-7B-instruct.i1-Q6_K.gguf b/DeciLM-7B-instruct.i1-Q6_K.gguf new file mode 100644 index 0000000..619a0a7 --- /dev/null +++ b/DeciLM-7B-instruct.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:54d1f2bfb862729440683e971efa955b9cf2273ba0648620f556f76081030885 +size 5779496288 diff --git a/README.md b/README.md new file mode 100644 index 0000000..f489b5b --- /dev/null +++ b/README.md @@ -0,0 +1,78 @@ +--- +base_model: Deci/DeciLM-7B-instruct +datasets: +- Open-Orca/SlimOrca +language: +- en +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/Deci/DeciLM-7B-instruct + + +static quants are available at https://huggingface.co/mradermacher/DeciLM-7B-instruct-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ1_S.gguf) | i1-IQ1_S | 1.6 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ1_M.gguf) | i1-IQ1_M | 1.8 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.0 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.2 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ2_S.gguf) | i1-IQ2_S | 2.3 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ2_M.gguf) | i1-IQ2_M | 2.5 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q2_K_S.gguf) | i1-Q2_K_S | 2.5 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q2_K.gguf) | i1-Q2_K | 2.7 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 2.8 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.0 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.2 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ3_S.gguf) | i1-IQ3_S | 3.2 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ3_M.gguf) | i1-IQ3_M | 3.3 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q3_K_M.gguf) | i1-Q3_K_M | 3.5 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q3_K_L.gguf) | i1-Q3_K_L | 3.8 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ4_XS.gguf) | i1-IQ4_XS | 3.9 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.1 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q4_0.gguf) | i1-Q4_0 | 4.1 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.1 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q4_K_M.gguf) | i1-Q4_K_M | 4.3 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q4_1.gguf) | i1-Q4_1 | 4.5 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.0 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.1 | | +| [GGUF](https://huggingface.co/mradermacher/DeciLM-7B-instruct-i1-GGUF/resolve/main/DeciLM-7B-instruct.i1-Q6_K.gguf) | i1-Q6_K | 5.9 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..22d0a67 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cb74659ce56a2c4d6fe69f1bc80bc30f2f3abf74973342a7da8af69cf6821e20 +size 4988157