commit df85a1ab1f2cc58cdd3082ecac76a18ca0a12bda Author: ModelHub XC Date: Mon May 11 23:32:35 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..6922f3b --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.imatrix.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_M.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_M.gguf new file mode 100644 index 0000000..4e6a030 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b7257aa239be7262053f16bca8d3dd3e892a170c69f09f4b4e9f393447ddf793 +size 2161973120 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_S.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_S.gguf new file mode 100644 index 0000000..652f38f --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2d213ae6538ae8c17b2a44a1257a75e5e5a04173eac01365739d734c593758d7 +size 2019628928 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_M.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_M.gguf new file mode 100644 index 0000000..4b2b14f --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2d4a59cbcc67636ec78019cd4cc98d7ebf3b05618985ce9d1bd0749764b9991d +size 2948282240 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_S.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_S.gguf new file mode 100644 index 0000000..ac0e91b --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:34a48c960968b429bd60c9bf7767542ba264a028ffc10d1d33f2d1408287e37b +size 2758489984 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XS.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XS.gguf new file mode 100644 index 0000000..7804cf2 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:de99a15f11d8e497ca34ef368eb6bd2ec2d504fcdfb9fdf7546b00580961eee0 +size 2605782912 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XXS.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..3d2c458 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1eecc42343387e64eb803e65528808877b55998c83b583ddd8afd4f28e9d3cd5 +size 2399213440 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_M.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_M.gguf new file mode 100644 index 0000000..84643a2 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d6e398bf891b7151907af0f0cb258fb7b14c538c17ab126148ca6e2e76f2113e +size 3784824704 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_S.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_S.gguf new file mode 100644 index 0000000..eac0b1b --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f41682c6d19881bc228be7abf1ddc42b8ff0087be51332f612ad885565c93205 +size 3682326400 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XS.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XS.gguf new file mode 100644 index 0000000..fdd2567 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:eb1627d93e4cc816abffbaf62346cb8bf1ca4ee7605003e81d1d5ec8dbb884d1 +size 3518748544 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XXS.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..64b9426 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9121e8a72623c12befafe8c82801a052de3d0ec17afd2adabf237356e282d49f +size 3274913664 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_NL.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_NL.gguf new file mode 100644 index 0000000..3a4c7e6 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ae845aee49312dba125e51d647f8340f5e4e63dccfee8517737890f442eed7c4 +size 4677990272 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_XS.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_XS.gguf new file mode 100644 index 0000000..67e787a --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:04834366e5daa119fdaa6b3cf2cebd9e275167b8fc93140640df80f69df2dddc +size 4447664000 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K.gguf new file mode 100644 index 0000000..e283218 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:412b4289be3e6c4efa4bab73fd8e72b51e5afa72b78120db9fc985e67c963745 +size 3179132800 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K_S.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K_S.gguf new file mode 100644 index 0000000..89da975 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8d6213bf847d68cb8dc60985fc2b24892bcf117522824b88ede19b96bebddc9b +size 2988816256 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_L.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_L.gguf new file mode 100644 index 0000000..3415c1b --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:816515cfe2f3d82a1f168423e3a35c0d31664b6d3d8d90a4c5e0b60ccb8feefc +size 4321957760 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_M.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_M.gguf new file mode 100644 index 0000000..c02e1db --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2028431958c65b32dd61d3362a30f033eabfa5c1af3fbb6ef6cb00dafdc93e3a +size 4018919296 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_S.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_S.gguf new file mode 100644 index 0000000..d4973fb --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b9effb4e78f3afe0f16ab96aa06af679cf35c7ed1cc8bdbde6f6f5b79ce1d500 +size 3664500608 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_0.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_0.gguf new file mode 100644 index 0000000..cd5dfd4 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1ce4cbd8b7acd9cac98d6346d04eb66161ac9454cd080f81d9fe962a5575e3dd +size 4675893120 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_1.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_1.gguf new file mode 100644 index 0000000..765efee --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f49d96de0e364ae3c4057ec4b40a239247704fd85bec6c35828afe078410ab11 +size 5130254208 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_M.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_M.gguf new file mode 100644 index 0000000..55b4dd4 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d66cd3d67904d98ac5d7e3a5d3add9890beaefee22cff8643b78342efecca84e +size 4920735616 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_S.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_S.gguf new file mode 100644 index 0000000..c007e60 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f97eac0c5ab67dcab56fe622a0c2ead72ebdd3e16773540c8856a477b5525bd3 +size 4692670336 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_M.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_M.gguf new file mode 100644 index 0000000..a99e561 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:25771fd2757fc39c9d7f026e129fa45b6a69d37cf720d8701bbe31b0a1293bbd +size 5732988800 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_S.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_S.gguf new file mode 100644 index 0000000..1b948cc --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7cf0401363d9fbb8b56645447cbb0d66ee5d1ca830468b2f5a36f2382fcc239a +size 5599295360 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q6_K.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q6_K.gguf new file mode 100644 index 0000000..7f8d7c2 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7faf80f36f38f051488338d2d9f939a1598f3ddc536e48325033474bb10c8c4e +size 6596007808 diff --git a/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.imatrix.gguf b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.imatrix.gguf new file mode 100644 index 0000000..931e375 --- /dev/null +++ b/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.imatrix.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b0b570524c3363645ac9238edbb282a9c25cba72c6a01f7d87eb1401db962940 +size 5015200 diff --git a/README.md b/README.md new file mode 100644 index 0000000..1dcdcd2 --- /dev/null +++ b/README.md @@ -0,0 +1,92 @@ +--- +base_model: ChiKoi7/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic +language: +- en +library_name: transformers +license: llama3.1 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- Llama3.1 +- SuperNova +- Uncensored +- Heretic +- mergekit +- merge +--- +## About + + + + + + + + + +weighted/imatrix quants of https://huggingface.co/ChiKoi7/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_S.gguf) | i1-IQ1_S | 2.1 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ1_M.gguf) | i1-IQ1_M | 2.3 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.5 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.7 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_S.gguf) | i1-IQ2_S | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ2_M.gguf) | i1-IQ2_M | 3.0 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K_S.gguf) | i1-Q2_K_S | 3.1 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q2_K.gguf) | i1-Q2_K | 3.3 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.6 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.8 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_S.gguf) | i1-IQ3_S | 3.8 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ3_M.gguf) | i1-IQ3_M | 3.9 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_M.gguf) | i1-Q3_K_M | 4.1 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.4 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.5 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_0.gguf) | i1-Q4_0 | 4.8 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-IQ4_NL.gguf) | i1-IQ4_NL | 4.8 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.8 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_K_M.gguf) | i1-Q4_K_M | 5.0 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q4_1.gguf) | i1-Q4_1 | 5.2 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.7 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.8 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic-i1-GGUF/resolve/main/Llama-3.1-SuperNova-8B-Lite_TIES_with_Base-Heretic.i1-Q6_K.gguf) | i1-Q6_K | 6.7 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + +