From 7016e8a9cca37b635c0414551bc1c6108f37efdc Mon Sep 17 00:00:00 2001 From: ModelHub XC Date: Wed, 29 Apr 2026 16:35:05 +0800 Subject: [PATCH] =?UTF-8?q?=E5=88=9D=E5=A7=8B=E5=8C=96=E9=A1=B9=E7=9B=AE?= =?UTF-8?q?=EF=BC=8C=E7=94=B1ModelHub=20XC=E7=A4=BE=E5=8C=BA=E6=8F=90?= =?UTF-8?q?=E4=BE=9B=E6=A8=A1=E5=9E=8B?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Model: mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF Source: Original Platform --- .gitattributes | 60 ++++++++++++++++++ Magellanic-Opus-14B-Exp.i1-IQ1_M.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ1_S.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ2_M.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ2_S.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ2_XS.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ2_XXS.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ3_M.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ3_S.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ3_XS.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ3_XXS.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ4_NL.gguf | 3 + Magellanic-Opus-14B-Exp.i1-IQ4_XS.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q2_K.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q2_K_S.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q3_K_L.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q3_K_M.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q3_K_S.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q4_0.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q4_1.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q4_K_M.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q4_K_S.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q5_K_M.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q5_K_S.gguf | 3 + Magellanic-Opus-14B-Exp.i1-Q6_K.gguf | 3 + README.md | 83 +++++++++++++++++++++++++ imatrix.dat | 3 + 27 files changed, 218 insertions(+) create mode 100644 .gitattributes create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ1_M.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ1_S.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ2_M.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ2_S.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ2_XS.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ2_XXS.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ3_M.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ3_S.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ3_XS.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ3_XXS.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ4_NL.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-IQ4_XS.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q2_K.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q2_K_S.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q3_K_L.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q3_K_M.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q3_K_S.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q4_0.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q4_1.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q4_K_M.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q4_K_S.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q5_K_M.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q5_K_S.gguf create mode 100644 Magellanic-Opus-14B-Exp.i1-Q6_K.gguf create mode 100644 README.md create mode 100644 imatrix.dat diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..1a5e0ec --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Magellanic-Opus-14B-Exp.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Magellanic-Opus-14B-Exp.i1-IQ1_M.gguf b/Magellanic-Opus-14B-Exp.i1-IQ1_M.gguf new file mode 100644 index 0000000..2201eef --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ce4fd29c5e09bcfcd2c8f0f805ea51bf5ac48686b7eb70dcca69e11c337a60ab +size 3870225856 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ1_S.gguf b/Magellanic-Opus-14B-Exp.i1-IQ1_S.gguf new file mode 100644 index 0000000..4129dec --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:69a3ec56375869b99c98e15253b6a1ffeb969a7df8f228d157e847ca8604ca51 +size 3605910976 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ2_M.gguf b/Magellanic-Opus-14B-Exp.i1-IQ2_M.gguf new file mode 100644 index 0000000..7f4b041 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5192ffdb5fd5e265b3316f7cda425f0b0324fa12294a57058bc1f7e6d715dbd6 +size 5353855808 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ2_S.gguf b/Magellanic-Opus-14B-Exp.i1-IQ2_S.gguf new file mode 100644 index 0000000..12d1157 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1363014ceca24bde6b965c3125fdd232e0cc50beda80f3567cb757cddc9a4dbc +size 5001435968 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ2_XS.gguf b/Magellanic-Opus-14B-Exp.i1-IQ2_XS.gguf new file mode 100644 index 0000000..57d8caa --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a8804597ca466e6f6f246dd05a32fecc15d8a6f14f94fb54832896e820e94278 +size 4702492096 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ2_XXS.gguf b/Magellanic-Opus-14B-Exp.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..63e9039 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5988cb49b3dff7274866aa70b8f5cb7d14c595bdf02ef48b8206d5a1faf153e7 +size 4310750656 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ3_M.gguf b/Magellanic-Opus-14B-Exp.i1-IQ3_M.gguf new file mode 100644 index 0000000..312bcc1 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d0e590f75e3e03bc54720b73828e11a1ebfe763c3dc26ca70f0c4b1893b00a77 +size 6913976192 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ3_S.gguf b/Magellanic-Opus-14B-Exp.i1-IQ3_S.gguf new file mode 100644 index 0000000..17bd1fc --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b6833444f27be725c9c85ddf8e97e03b34d625ec7897f5ff96dbb7a07d03a22a +size 6690457472 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ3_XS.gguf b/Magellanic-Opus-14B-Exp.i1-IQ3_XS.gguf new file mode 100644 index 0000000..c885f69 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3be925815bad709dbe80588d4375cf10241ade6a7f6dc4ff6562cdfe273e2680 +size 6380799872 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ3_XXS.gguf b/Magellanic-Opus-14B-Exp.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..6f50731 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:df29d52d3fdb8bf830b18bd5c4bbbb6fe350d04694028fd2b204cd71c5076b22 +size 5944417088 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ4_NL.gguf b/Magellanic-Opus-14B-Exp.i1-IQ4_NL.gguf new file mode 100644 index 0000000..00f6fc8 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:195e635318e478df62249e341f86c73651a8b97b0f5ae4b68b9cf096a87f880e +size 8546349984 diff --git a/Magellanic-Opus-14B-Exp.i1-IQ4_XS.gguf b/Magellanic-Opus-14B-Exp.i1-IQ4_XS.gguf new file mode 100644 index 0000000..a6016bc --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bb413c852b39454d6eb6f2e67354162cc1079b80ee2805c8578c3ed66dca3cb3 +size 8117071104 diff --git a/Magellanic-Opus-14B-Exp.i1-Q2_K.gguf b/Magellanic-Opus-14B-Exp.i1-Q2_K.gguf new file mode 100644 index 0000000..d84ee67 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fc58f5c783318bb05ebf75bba1eb96918c3435a26eef18bd7bf98280d1482a79 +size 5768143360 diff --git a/Magellanic-Opus-14B-Exp.i1-Q2_K_S.gguf b/Magellanic-Opus-14B-Exp.i1-Q2_K_S.gguf new file mode 100644 index 0000000..34062d4 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dfeeee5f3bc486a243d4c76548fadabeea9855b3804097917cbb2ae98b744b2d +size 5394833920 diff --git a/Magellanic-Opus-14B-Exp.i1-Q3_K_L.gguf b/Magellanic-Opus-14B-Exp.i1-Q3_K_L.gguf new file mode 100644 index 0000000..8736c94 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c0786f6f62cf497a97612d5508dfae467eadb2b1448c109acfbd6b5dc804a2f9 +size 7922206592 diff --git a/Magellanic-Opus-14B-Exp.i1-Q3_K_M.gguf b/Magellanic-Opus-14B-Exp.i1-Q3_K_M.gguf new file mode 100644 index 0000000..1d3d015 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:169d4df2cac69868c6125db9da4830d2b0adbd2efd1b6fa713c73e8f7757044c +size 7336642432 diff --git a/Magellanic-Opus-14B-Exp.i1-Q3_K_S.gguf b/Magellanic-Opus-14B-Exp.i1-Q3_K_S.gguf new file mode 100644 index 0000000..b96b1da --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b9826ed36f5116c328b8567265cac3d267746655697feba3ec8e712624b6a376 +size 6657034112 diff --git a/Magellanic-Opus-14B-Exp.i1-Q4_0.gguf b/Magellanic-Opus-14B-Exp.i1-Q4_0.gguf new file mode 100644 index 0000000..04a667e --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8efa2b1995f3fce7288d1b244f1f39b032ece62500041aad310b08bea4df7bdb +size 8541434784 diff --git a/Magellanic-Opus-14B-Exp.i1-Q4_1.gguf b/Magellanic-Opus-14B-Exp.i1-Q4_1.gguf new file mode 100644 index 0000000..fbab329 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:afe659fa17bab6f7c937999b4413ebff0ef653d4c286914a3a78450866d30b97 +size 9389179104 diff --git a/Magellanic-Opus-14B-Exp.i1-Q4_K_M.gguf b/Magellanic-Opus-14B-Exp.i1-Q4_K_M.gguf new file mode 100644 index 0000000..eafba30 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a2a0470f6f66ee7f1168e18de38f74556e4f3ad85d1dc69483a033514f1e5861 +size 8985277344 diff --git a/Magellanic-Opus-14B-Exp.i1-Q4_K_S.gguf b/Magellanic-Opus-14B-Exp.i1-Q4_K_S.gguf new file mode 100644 index 0000000..d357d9d --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6a80cd0e9c7b4fa5c24ce62ecbcddefe83ec6fdc04b0b68c5bcedac0fac8429d +size 8570598304 diff --git a/Magellanic-Opus-14B-Exp.i1-Q5_K_M.gguf b/Magellanic-Opus-14B-Exp.i1-Q5_K_M.gguf new file mode 100644 index 0000000..88c509f --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d6cade97693b9ae662e411bd61baa1583e39d64aff449f335392cf27c767179b +size 10505784864 diff --git a/Magellanic-Opus-14B-Exp.i1-Q5_K_S.gguf b/Magellanic-Opus-14B-Exp.i1-Q5_K_S.gguf new file mode 100644 index 0000000..42b9aba --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:275f2130a5f79e44d6f978643c4a47a8e570fc7ffa8a76e26c170020fea2d876 +size 10263465504 diff --git a/Magellanic-Opus-14B-Exp.i1-Q6_K.gguf b/Magellanic-Opus-14B-Exp.i1-Q6_K.gguf new file mode 100644 index 0000000..ff5ae27 --- /dev/null +++ b/Magellanic-Opus-14B-Exp.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ef0ec9d35e7afa9653056d3da688e80098441f68a96a1168adc42166d4fae1bd +size 12121324128 diff --git a/README.md b/README.md new file mode 100644 index 0000000..c85fa59 --- /dev/null +++ b/README.md @@ -0,0 +1,83 @@ +--- +base_model: prithivMLmods/Magellanic-Opus-14B-Exp +language: +- en +- zh +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +tags: +- text-generation-inference +- math +- trl +- reasoning +- QwQ +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/prithivMLmods/Magellanic-Opus-14B-Exp + + +static quants are available at https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ1_S.gguf) | i1-IQ1_S | 3.7 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ1_M.gguf) | i1-IQ1_M | 4.0 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 4.4 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ2_XS.gguf) | i1-IQ2_XS | 4.8 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ2_S.gguf) | i1-IQ2_S | 5.1 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ2_M.gguf) | i1-IQ2_M | 5.5 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q2_K_S.gguf) | i1-Q2_K_S | 5.5 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q2_K.gguf) | i1-Q2_K | 5.9 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 6.0 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ3_XS.gguf) | i1-IQ3_XS | 6.5 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q3_K_S.gguf) | i1-Q3_K_S | 6.8 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ3_S.gguf) | i1-IQ3_S | 6.8 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ3_M.gguf) | i1-IQ3_M | 7.0 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q3_K_M.gguf) | i1-Q3_K_M | 7.4 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q3_K_L.gguf) | i1-Q3_K_L | 8.0 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ4_XS.gguf) | i1-IQ4_XS | 8.2 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q4_0.gguf) | i1-Q4_0 | 8.6 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-IQ4_NL.gguf) | i1-IQ4_NL | 8.6 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q4_K_S.gguf) | i1-Q4_K_S | 8.7 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q4_K_M.gguf) | i1-Q4_K_M | 9.1 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q4_1.gguf) | i1-Q4_1 | 9.5 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q5_K_S.gguf) | i1-Q5_K_S | 10.4 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q5_K_M.gguf) | i1-Q5_K_M | 10.6 | | +| [GGUF](https://huggingface.co/mradermacher/Magellanic-Opus-14B-Exp-i1-GGUF/resolve/main/Magellanic-Opus-14B-Exp.i1-Q6_K.gguf) | i1-Q6_K | 12.2 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..e23a8e5 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:39e26666939e8a55b53838beadb0e467115a1b290e1b4f1139a0f57a0a7ce948 +size 8563597