commit 22cc22c4abcd9223feda1589570776438c59c8f6 Author: ModelHub XC Date: Wed May 13 20:52:21 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Geneva-12B-GCv2-100k-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..2050495 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Geneva-12B-GCv2-100k.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Geneva-12B-GCv2-100k.i1-IQ1_M.gguf b/Geneva-12B-GCv2-100k.i1-IQ1_M.gguf new file mode 100644 index 0000000..0c8d467 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fceb67427c34875da9518f76053fb45c20dbe844feba6493fa903eb7f2088b14 +size 3221629280 diff --git a/Geneva-12B-GCv2-100k.i1-IQ1_S.gguf b/Geneva-12B-GCv2-100k.i1-IQ1_S.gguf new file mode 100644 index 0000000..d2599fa --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:93b59b7710a5ab365ececd0ee996ede4d1faeca14a18c251c5d322bff3a2625b +size 2999216480 diff --git a/Geneva-12B-GCv2-100k.i1-IQ2_M.gguf b/Geneva-12B-GCv2-100k.i1-IQ2_M.gguf new file mode 100644 index 0000000..f35efee --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cc9d713a0caa3f0d597ee3f1955b81cafeb0ccd4d14a2a862245528376e208c6 +size 4435028320 diff --git a/Geneva-12B-GCv2-100k.i1-IQ2_S.gguf b/Geneva-12B-GCv2-100k.i1-IQ2_S.gguf new file mode 100644 index 0000000..3478656 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:20048c2f2ba784e5b37ba7a98c56db045dd69b94aef0702a26c507a7919ba982 +size 4138477920 diff --git a/Geneva-12B-GCv2-100k.i1-IQ2_XS.gguf b/Geneva-12B-GCv2-100k.i1-IQ2_XS.gguf new file mode 100644 index 0000000..cdb851a --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:df5df7da5647dfc2eaea33097d9cf23b015128b1ed9ffa112746b0b0c134cad6 +size 3915082080 diff --git a/Geneva-12B-GCv2-100k.i1-IQ2_XXS.gguf b/Geneva-12B-GCv2-100k.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..444d618 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6fea9bdd934341ab9cbaa518054ff9a06e9ce1911aadec1fa5558cdb904563f2 +size 3592317280 diff --git a/Geneva-12B-GCv2-100k.i1-IQ3_M.gguf b/Geneva-12B-GCv2-100k.i1-IQ3_M.gguf new file mode 100644 index 0000000..e25cb16 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cdcb408646110203273d8686fa589bc50166a0f04335ae2acf52ce26c2131b7d +size 5722237280 diff --git a/Geneva-12B-GCv2-100k.i1-IQ3_S.gguf b/Geneva-12B-GCv2-100k.i1-IQ3_S.gguf new file mode 100644 index 0000000..443587e --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dee18b5f9b839dbec5dd40d1fe3008efd33595fc0ab9fcc874cd9288ee5c5acd +size 5562083680 diff --git a/Geneva-12B-GCv2-100k.i1-IQ3_XS.gguf b/Geneva-12B-GCv2-100k.i1-IQ3_XS.gguf new file mode 100644 index 0000000..40a5082 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:669cdc26771e917176c27ac5c6617eb8983411ed0ee8a05ac67535731bfc6ed4 +size 5306493280 diff --git a/Geneva-12B-GCv2-100k.i1-IQ3_XXS.gguf b/Geneva-12B-GCv2-100k.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..9951837 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8b0bae2f44d8e8cb032bf05b13cc665f029115f360c255494a3ecfa8ee38071a +size 4945389920 diff --git a/Geneva-12B-GCv2-100k.i1-IQ4_NL.gguf b/Geneva-12B-GCv2-100k.i1-IQ4_NL.gguf new file mode 100644 index 0000000..f0f2d04 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8c425e8a4baeece0177a12235cf0553677addcba6a778aceffe210ceff9f2aca +size 7097919840 diff --git a/Geneva-12B-GCv2-100k.i1-IQ4_XS.gguf b/Geneva-12B-GCv2-100k.i1-IQ4_XS.gguf new file mode 100644 index 0000000..8a48ae9 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:801c44d4aeed3abfce29d8fc40ffd27431f1006db4c8d708559c5818b8dd7578 +size 6742714720 diff --git a/Geneva-12B-GCv2-100k.i1-Q2_K.gguf b/Geneva-12B-GCv2-100k.i1-Q2_K.gguf new file mode 100644 index 0000000..38d581c --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7dad42e4b952d82b04e95d68ed0e162f67a78baa5d16bd85cd45aa885fb8086d +size 4791052640 diff --git a/Geneva-12B-GCv2-100k.i1-Q2_K_S.gguf b/Geneva-12B-GCv2-100k.i1-Q2_K_S.gguf new file mode 100644 index 0000000..eb3bd77 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6b4ddc88521e6f2f16657e53acc8dafe60ce319bdf01d09ac9ea8f2132e27429 +size 4493683040 diff --git a/Geneva-12B-GCv2-100k.i1-Q3_K_L.gguf b/Geneva-12B-GCv2-100k.i1-Q3_K_L.gguf new file mode 100644 index 0000000..844cf1f --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f4a214236a1658aa93afa4afb69855936feb6545d5bb1a8c2945cb8fe5261503 +size 6561507680 diff --git a/Geneva-12B-GCv2-100k.i1-Q3_K_M.gguf b/Geneva-12B-GCv2-100k.i1-Q3_K_M.gguf new file mode 100644 index 0000000..5368c6d --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f488fab05030265df513c6ec266efed05dc2c652c4d5bcfe36721fa0c35ee6a4 +size 6083094880 diff --git a/Geneva-12B-GCv2-100k.i1-Q3_K_S.gguf b/Geneva-12B-GCv2-100k.i1-Q3_K_S.gguf new file mode 100644 index 0000000..2de2875 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6cb605d36dea37120dc515e744615485046c0902fc1a99c75ec6a0a3ec9a0c36 +size 5534230880 diff --git a/Geneva-12B-GCv2-100k.i1-Q4_0.gguf b/Geneva-12B-GCv2-100k.i1-Q4_0.gguf new file mode 100644 index 0000000..00f1974 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:86c7a3e727940a9e6b65d7ad8575effb3f0ab706b8d82de7cc4ad023726c4d4c +size 7094643040 diff --git a/Geneva-12B-GCv2-100k.i1-Q4_1.gguf b/Geneva-12B-GCv2-100k.i1-Q4_1.gguf new file mode 100644 index 0000000..86dc7b8 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:616bd4ddf83e0ddca4c64c4324a514bb213ee737ceb63bd325d2087c6052875f +size 7795222880 diff --git a/Geneva-12B-GCv2-100k.i1-Q4_K_M.gguf b/Geneva-12B-GCv2-100k.i1-Q4_K_M.gguf new file mode 100644 index 0000000..429325d --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b29284025dbdd5faa9674bad5278d2a5cde325537a49d75a7c7053ad2604e0e6 +size 7477209440 diff --git a/Geneva-12B-GCv2-100k.i1-Q4_K_S.gguf b/Geneva-12B-GCv2-100k.i1-Q4_K_S.gguf new file mode 100644 index 0000000..21ff840 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f3ef87f091a1bfeb8fbeef25bb07dfb837b84d70963179227a4768459bd8e904 +size 7120202080 diff --git a/Geneva-12B-GCv2-100k.i1-Q5_K_M.gguf b/Geneva-12B-GCv2-100k.i1-Q5_K_M.gguf new file mode 100644 index 0000000..ed66480 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a91806fa098f8b5ef5dffba0f0585d4c3b0aaa835818ddeebeec538f2f5e493 +size 8727636320 diff --git a/Geneva-12B-GCv2-100k.i1-Q5_K_S.gguf b/Geneva-12B-GCv2-100k.i1-Q5_K_S.gguf new file mode 100644 index 0000000..2faaef1 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bd061672858906e2927edd725ee75f1e9b9dc97ac9b5725dd61ca8b077614aea +size 8518740320 diff --git a/Geneva-12B-GCv2-100k.i1-Q6_K.gguf b/Geneva-12B-GCv2-100k.i1-Q6_K.gguf new file mode 100644 index 0000000..c710150 --- /dev/null +++ b/Geneva-12B-GCv2-100k.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d1dc04e92b26a759313dc672a3e36e74f29bc8ef216fed9088e24b1eecdfc44c +size 10056214880 diff --git a/README.md b/README.md new file mode 100644 index 0000000..72ccd9c --- /dev/null +++ b/README.md @@ -0,0 +1,96 @@ +--- +base_model: rubenroy/Geneva-12B-GCv2-100k +datasets: +- rubenroy/GammaCorpus-v2-100k +language: +- en +- fr +- de +- es +- it +- pt +- ru +- zh +- ja +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +tags: +- text-generation-inference +- transformers +- unsloth +- trl +- gammacorpus +- geneva +- chat +- mistral +- conversational +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/rubenroy/Geneva-12B-GCv2-100k + + +static quants are available at https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ1_S.gguf) | i1-IQ1_S | 3.1 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ1_M.gguf) | i1-IQ1_M | 3.3 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 3.7 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ2_XS.gguf) | i1-IQ2_XS | 4.0 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ2_S.gguf) | i1-IQ2_S | 4.2 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ2_M.gguf) | i1-IQ2_M | 4.5 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q2_K_S.gguf) | i1-Q2_K_S | 4.6 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q2_K.gguf) | i1-Q2_K | 4.9 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 5.0 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ3_XS.gguf) | i1-IQ3_XS | 5.4 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q3_K_S.gguf) | i1-Q3_K_S | 5.6 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ3_S.gguf) | i1-IQ3_S | 5.7 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ3_M.gguf) | i1-IQ3_M | 5.8 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q3_K_M.gguf) | i1-Q3_K_M | 6.2 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q3_K_L.gguf) | i1-Q3_K_L | 6.7 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ4_XS.gguf) | i1-IQ4_XS | 6.8 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q4_0.gguf) | i1-Q4_0 | 7.2 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-IQ4_NL.gguf) | i1-IQ4_NL | 7.2 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q4_K_S.gguf) | i1-Q4_K_S | 7.2 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q4_K_M.gguf) | i1-Q4_K_M | 7.6 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q4_1.gguf) | i1-Q4_1 | 7.9 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q5_K_S.gguf) | i1-Q5_K_S | 8.6 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q5_K_M.gguf) | i1-Q5_K_M | 8.8 | | +| [GGUF](https://huggingface.co/mradermacher/Geneva-12B-GCv2-100k-i1-GGUF/resolve/main/Geneva-12B-GCv2-100k.i1-Q6_K.gguf) | i1-Q6_K | 10.2 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..d81d2d2 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:526ed9ca705188687d4b3b021b12a3ea8a14aa9efd62c0fb67bb9ba3c198e107 +size 7054405