commit 5ea375498cf6d3d4e24a2bb460beee7713e87fdf Author: ModelHub XC Date: Wed May 13 05:44:32 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Herodotos-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..11262ca --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Herodotos.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Herodotos.i1-IQ1_M.gguf b/Herodotos.i1-IQ1_M.gguf new file mode 100644 index 0000000..42b823d --- /dev/null +++ b/Herodotos.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:460620f14a2937df8b5130d3827f31cb8ab690f3f8f683685fb4d1c90a803f6e +size 3872309824 diff --git a/Herodotos.i1-IQ1_S.gguf b/Herodotos.i1-IQ1_S.gguf new file mode 100644 index 0000000..1f666d7 --- /dev/null +++ b/Herodotos.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0853ce724420a857dd883d490d769442e52083610ea37aa10f4a2c9cafe19414 +size 3607994944 diff --git a/Herodotos.i1-IQ2_M.gguf b/Herodotos.i1-IQ2_M.gguf new file mode 100644 index 0000000..5a35266 --- /dev/null +++ b/Herodotos.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4ac9c200389710228d6442fc76ea8b36c7dcff04a73ccfa059093085230874de +size 5356147264 diff --git a/Herodotos.i1-IQ2_S.gguf b/Herodotos.i1-IQ2_S.gguf new file mode 100644 index 0000000..822d752 --- /dev/null +++ b/Herodotos.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9e85122202f05a0923fffe37d8d4e003ea6893f13c6054b873e600e43c8a5aef +size 5003727424 diff --git a/Herodotos.i1-IQ2_XS.gguf b/Herodotos.i1-IQ2_XS.gguf new file mode 100644 index 0000000..572b120 --- /dev/null +++ b/Herodotos.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a416376a477ec9ab93f4ee14e9b6140c9bf2f491902a9c9c7205d674eacf9f9d +size 4704576064 diff --git a/Herodotos.i1-IQ2_XXS.gguf b/Herodotos.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..72a375c --- /dev/null +++ b/Herodotos.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a4c33e605d2635414ae99d659eba28ee570ef404cab9af3e6ff0d1adb03d2eb +size 4312834624 diff --git a/Herodotos.i1-IQ3_M.gguf b/Herodotos.i1-IQ3_M.gguf new file mode 100644 index 0000000..148b8e1 --- /dev/null +++ b/Herodotos.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e795712620fe7f0b87a94a61d4b52c42363e2ddb9d9ff7100bf95b80a92d7c34 +size 6916538944 diff --git a/Herodotos.i1-IQ3_S.gguf b/Herodotos.i1-IQ3_S.gguf new file mode 100644 index 0000000..18c5e04 --- /dev/null +++ b/Herodotos.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0d422a2b29ae984b85de20deaad4f35ffa8de24c73d45ffe14c7426fdb5c3a97 +size 6693020224 diff --git a/Herodotos.i1-IQ3_XS.gguf b/Herodotos.i1-IQ3_XS.gguf new file mode 100644 index 0000000..3e5548b --- /dev/null +++ b/Herodotos.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:85f92e02a97dba6fc1a2f428104bd061f258d8c4ba69d009d93a31ee7e7479c5 +size 6383362624 diff --git a/Herodotos.i1-IQ3_XXS.gguf b/Herodotos.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..5eb981c --- /dev/null +++ b/Herodotos.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:52fb66ce9538e4f9de95ef1c4b13bc3f28fa289b17165f74dab82c1f0290b412 +size 5946708544 diff --git a/Herodotos.i1-IQ4_NL.gguf b/Herodotos.i1-IQ4_NL.gguf new file mode 100644 index 0000000..2725e7e --- /dev/null +++ b/Herodotos.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:467cb2fce2cec17d9c296ef4672e058c19421acc291527df48c98cd033b7dfd6 +size 8549184064 diff --git a/Herodotos.i1-IQ4_XS.gguf b/Herodotos.i1-IQ4_XS.gguf new file mode 100644 index 0000000..00ede03 --- /dev/null +++ b/Herodotos.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b01bb961fbb381457a378466edca343fe23a255d79e4b6502f98b795d73e6eff +size 8119841344 diff --git a/Herodotos.i1-Q2_K.gguf b/Herodotos.i1-Q2_K.gguf new file mode 100644 index 0000000..c50e446 --- /dev/null +++ b/Herodotos.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7b0ffe26db8e0ce5e1eadaf353e1dcbb9dea6b5f5d66e803afa206172671f073 +size 5770498624 diff --git a/Herodotos.i1-Q2_K_S.gguf b/Herodotos.i1-Q2_K_S.gguf new file mode 100644 index 0000000..b9fd2bb --- /dev/null +++ b/Herodotos.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5cbfb3bccc9e6fa69ddb4a5070ed223dd62d1f08993668621901ad9fd105e158 +size 5397189184 diff --git a/Herodotos.i1-Q3_K_L.gguf b/Herodotos.i1-Q3_K_L.gguf new file mode 100644 index 0000000..e7f089d --- /dev/null +++ b/Herodotos.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a0289878bdc361098df1c772ea5f6486759d8ef023d122cddedbde24ba27092 +size 7924769344 diff --git a/Herodotos.i1-Q3_K_M.gguf b/Herodotos.i1-Q3_K_M.gguf new file mode 100644 index 0000000..4cfb1cc --- /dev/null +++ b/Herodotos.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8e7337bdeead07e61032f87c15c0f02f255b9783aa6e40b73933bed3bb21939f +size 7339205184 diff --git a/Herodotos.i1-Q3_K_S.gguf b/Herodotos.i1-Q3_K_S.gguf new file mode 100644 index 0000000..c451d7b --- /dev/null +++ b/Herodotos.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:33f0c739fc5e6a62526a75eabc3e99b0004fd227867d3b0a1817c39532476309 +size 6659596864 diff --git a/Herodotos.i1-Q4_0.gguf b/Herodotos.i1-Q4_0.gguf new file mode 100644 index 0000000..2ca96ac --- /dev/null +++ b/Herodotos.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b69d672e586def09a82ed4c5cc05b18f2bb40d13a58f2ff852a570d605b98846 +size 8544268864 diff --git a/Herodotos.i1-Q4_1.gguf b/Herodotos.i1-Q4_1.gguf new file mode 100644 index 0000000..438fd46 --- /dev/null +++ b/Herodotos.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5af05c70a7a28613ad89297760d1ec3b91ccb1019e09c7f6cae0916dc82b089d +size 9392140864 diff --git a/Herodotos.i1-Q4_K_M.gguf b/Herodotos.i1-Q4_K_M.gguf new file mode 100644 index 0000000..45db376 --- /dev/null +++ b/Herodotos.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e289fa35b716800cfb583e93cc66a6ce544062dfd6b1ca400db8b99d85736dd5 +size 8988111424 diff --git a/Herodotos.i1-Q4_K_S.gguf b/Herodotos.i1-Q4_K_S.gguf new file mode 100644 index 0000000..fae8233 --- /dev/null +++ b/Herodotos.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8b3fa43cedeb57cf71d0d5e2b4d3570735fd9ffa3334b098b44504395c8074c9 +size 8573432384 diff --git a/Herodotos.i1-Q5_K_M.gguf b/Herodotos.i1-Q5_K_M.gguf new file mode 100644 index 0000000..a95d05f --- /dev/null +++ b/Herodotos.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c83c46f43f3b893572a5caefd2bdab05450b3e5ccfddcd3b4ee9f8ef6f4f9876 +size 10508874304 diff --git a/Herodotos.i1-Q5_K_S.gguf b/Herodotos.i1-Q5_K_S.gguf new file mode 100644 index 0000000..e6db376 --- /dev/null +++ b/Herodotos.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d3860ba0f78ef920b9639f02316f5b013cea3478ba71b7efcf7e250946821847 +size 10266554944 diff --git a/Herodotos.i1-Q6_K.gguf b/Herodotos.i1-Q6_K.gguf new file mode 100644 index 0000000..afc3216 --- /dev/null +++ b/Herodotos.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:df23bc56bfce738fa08f8a42822ccd5fab72fbbb1234d706671a23b02e637069 +size 12124684864 diff --git a/README.md b/README.md new file mode 100644 index 0000000..44d5e67 --- /dev/null +++ b/README.md @@ -0,0 +1,79 @@ +--- +base_model: Triangle104/Herodotos-14B +language: +- en +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +tags: +- mergekit +- merge +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/Triangle104/Herodotos-14B + + +static quants are available at https://huggingface.co/mradermacher/Herodotos-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ1_S.gguf) | i1-IQ1_S | 3.7 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ1_M.gguf) | i1-IQ1_M | 4.0 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 4.4 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ2_XS.gguf) | i1-IQ2_XS | 4.8 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ2_S.gguf) | i1-IQ2_S | 5.1 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ2_M.gguf) | i1-IQ2_M | 5.5 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q2_K_S.gguf) | i1-Q2_K_S | 5.5 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q2_K.gguf) | i1-Q2_K | 5.9 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 6.0 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ3_XS.gguf) | i1-IQ3_XS | 6.5 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q3_K_S.gguf) | i1-Q3_K_S | 6.8 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ3_S.gguf) | i1-IQ3_S | 6.8 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ3_M.gguf) | i1-IQ3_M | 7.0 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q3_K_M.gguf) | i1-Q3_K_M | 7.4 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q3_K_L.gguf) | i1-Q3_K_L | 8.0 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ4_XS.gguf) | i1-IQ4_XS | 8.2 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q4_0.gguf) | i1-Q4_0 | 8.6 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-IQ4_NL.gguf) | i1-IQ4_NL | 8.6 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q4_K_S.gguf) | i1-Q4_K_S | 8.7 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q4_K_M.gguf) | i1-Q4_K_M | 9.1 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q4_1.gguf) | i1-Q4_1 | 9.5 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q5_K_S.gguf) | i1-Q5_K_S | 10.4 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q5_K_M.gguf) | i1-Q5_K_M | 10.6 | | +| [GGUF](https://huggingface.co/mradermacher/Herodotos-i1-GGUF/resolve/main/Herodotos.i1-Q6_K.gguf) | i1-Q6_K | 12.2 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..47ff8c4 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a7d1bb4c8dc3112b83d564a702388d3f7fa51190324614ff05d5307a730af8bd +size 8563597