commit b58bb96e6f1be9aacb1a1bda625f24a649e1f470 Author: ModelHub XC Date: Mon May 18 22:12:50 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..b03a039 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_8.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_8_8.gguf filter=lfs diff=lfs merge=lfs -text +MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_M.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_M.gguf new file mode 100644 index 0000000..e21c760 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1482af9e3a524aeca369410360b508d05dc58494c8eb2ad7da4ac04c9e4a8441 +size 2161973760 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_S.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_S.gguf new file mode 100644 index 0000000..f623213 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:336c478725bd565aca4a65c330021c36fc2d5c07bfa05a9f3b614342b609927b +size 2019629568 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_M.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_M.gguf new file mode 100644 index 0000000..2a99aed --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:29c4895df059aa0462ff0eb07c91a5dd18ce7ee1709f8f1f6ec2eed734bc1fa5 +size 2948282880 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_S.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_S.gguf new file mode 100644 index 0000000..75c8c3c --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:50062fb053ca5e879ce8314a4a64eef343210dda5d378d2b6be8fae93ab0ed00 +size 2758490624 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XS.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XS.gguf new file mode 100644 index 0000000..917da73 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:071de372c03109720f15d3a12eb868727e9b0edc027de08c30f6ab49baf5c418 +size 2605783552 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XXS.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..1bdc002 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8e841b09b85f095f9d506c0baa3a700dd5f32a714a3c1fb43b84e37852b20dd6 +size 2399214080 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_M.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_M.gguf new file mode 100644 index 0000000..041d59f --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:371b8883672b2c3a2f309afc6db8ef8c4fd47c89c5c37b6354ceb71f43f9b238 +size 3784825344 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_S.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_S.gguf new file mode 100644 index 0000000..6e22cf8 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7674ba69eee807e7b6895f107e2cafa2007e960556ba67063ace57c67b4b9973 +size 3682327040 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XS.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XS.gguf new file mode 100644 index 0000000..2289bcf --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f067671334ef636af71b00f6c1ca7ccc6415064fdbd7aa78b23d445d2d0de088 +size 3518749184 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XXS.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..a50da20 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:913edad5550709310e60f802c781bbd3ca199fe718874800438b52b6c159cb90 +size 3274914304 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ4_XS.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ4_XS.gguf new file mode 100644 index 0000000..53b0df2 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:821be77dee20b1c1015322ea81c05d6b8aca1bfb1f39b24c06864339233b41b3 +size 4447664640 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q2_K.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q2_K.gguf new file mode 100644 index 0000000..76e332c --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e6294d5a2bb6711b8be9789b78b9d0af9bd82db5dda53ad35d44e389862c410c +size 3179133440 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_L.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_L.gguf new file mode 100644 index 0000000..774dd2b --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:630731a0cb8694daa4c39ad6e04856f7cc319816722bf3ee8fbd3a8ce4f9a5cd +size 4321958400 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_M.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_M.gguf new file mode 100644 index 0000000..74cac01 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9a8720bf9ede2861d9dd8eb122ddc81c89b7b7806d3c906d73718d3ee27e5902 +size 4018919936 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_S.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_S.gguf new file mode 100644 index 0000000..a99c90e --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1578b65161b1e337a8d4f61ab3c7e84028995d9286639fa0fddaaecdfcf59b68 +size 3664501248 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0.gguf new file mode 100644 index 0000000..c5bb576 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b8510a62eef7b85be826bc20c5d7f04504e536711b5e0f218e5556074f0d8315 +size 4675893760 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_4.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_4.gguf new file mode 100644 index 0000000..34732f2 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_4.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:71111af8440ff793be04fa850159c59c87190508341feb3e6f5b9309e7a9b77e +size 4661213696 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_8.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_8.gguf new file mode 100644 index 0000000..470a36c --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_8.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ce2481ad3b27500f837e12b99a90e4834e86cc1ebf43163e80f40886a9738874 +size 4661213696 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_8_8.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_8_8.gguf new file mode 100644 index 0000000..87d2b87 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_8_8.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6a3ef17a12d94640d39999a7e813aa22b8caa699ac510552057f2fbcb330deaf +size 4661213696 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_M.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_M.gguf new file mode 100644 index 0000000..72feed7 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a4f4ec0c4d92afb21c6c608e7d29ca608fc3a6eab88bd1918ba67b214ed96ef +size 4920736256 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_S.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_S.gguf new file mode 100644 index 0000000..f0be451 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e4321f695c77eb5990e85a51da4fba33a5bea9a6bb06db46e5ca656dbb3d02f6 +size 4692670976 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_M.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_M.gguf new file mode 100644 index 0000000..14c9f1e --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0aa4708d8df8b83ce87f795b8f7895efe391d79427ec995788d5e4595ea6eb10 +size 5732989440 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_S.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_S.gguf new file mode 100644 index 0000000..1330d63 --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0c75c66c248f274697486c5daf40ceb33e3970dae20848184204d93c774f2776 +size 5599296000 diff --git a/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q6_K.gguf b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q6_K.gguf new file mode 100644 index 0000000..075794b --- /dev/null +++ b/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cc43a433df6d2524d65c22f974f2ad3deb702f79d9915638c6beab7b3ff75583 +size 6596008448 diff --git a/README.md b/README.md new file mode 100644 index 0000000..d2ca4e6 --- /dev/null +++ b/README.md @@ -0,0 +1,84 @@ +--- +base_model: netcat420/MFANN-llama3.1-abliterated-SLERP-v3.1 +datasets: +- netcat420/MFANN +language: +- en +library_name: transformers +license: llama3.1 +quantized_by: mradermacher +tags: +- merge +- mergekit +- lazymergekit +- netcat420/MFANN-llama3.1-abliterated-v2 +- netcat420/MFANN-llama3.1-abliterated-SLERP-v3 +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/netcat420/MFANN-llama3.1-abliterated-SLERP-v3.1 + + +static quants are available at https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_S.gguf) | i1-IQ1_S | 2.1 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ1_M.gguf) | i1-IQ1_M | 2.3 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 2.5 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_XS.gguf) | i1-IQ2_XS | 2.7 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_S.gguf) | i1-IQ2_S | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ2_M.gguf) | i1-IQ2_M | 3.0 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q2_K.gguf) | i1-Q2_K | 3.3 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 3.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_XS.gguf) | i1-IQ3_XS | 3.6 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_S.gguf) | i1-Q3_K_S | 3.8 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_S.gguf) | i1-IQ3_S | 3.8 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ3_M.gguf) | i1-IQ3_M | 3.9 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_M.gguf) | i1-Q3_K_M | 4.1 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.4 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.5 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_4.gguf) | i1-Q4_0_4_4 | 4.8 | fast on arm, low quality | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_4_8.gguf) | i1-Q4_0_4_8 | 4.8 | fast on arm+i8mm, low quality | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0_8_8.gguf) | i1-Q4_0_8_8 | 4.8 | fast on arm+sve, low quality | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_0.gguf) | i1-Q4_0 | 4.8 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_S.gguf) | i1-Q4_K_S | 4.8 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q4_K_M.gguf) | i1-Q4_K_M | 5.0 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_S.gguf) | i1-Q5_K_S | 5.7 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q5_K_M.gguf) | i1-Q5_K_M | 5.8 | | +| [GGUF](https://huggingface.co/mradermacher/MFANN-llama3.1-abliterated-SLERP-v3.1-i1-GGUF/resolve/main/MFANN-llama3.1-abliterated-SLERP-v3.1.i1-Q6_K.gguf) | i1-Q6_K | 6.7 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..199c3d8 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d7ba1933d5627416d86e6048b1b6070923bd787868baf9307dbe5b21641d6a33 +size 4988157