commit 12bd673e8f6e7667e61c1c3fc72fa1507c483233 Author: ModelHub XC Date: Sat Apr 25 12:13:06 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/AceMath-1.5B-Instruct-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..a91206c --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +AceMath-1.5B-Instruct.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/AceMath-1.5B-Instruct.i1-IQ1_M.gguf b/AceMath-1.5B-Instruct.i1-IQ1_M.gguf new file mode 100644 index 0000000..ef60000 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a898e67c5ab09c01c726c42efcc8646ee4dacc4fb03e9f56f5e744be7eceed73 +size 541035424 diff --git a/AceMath-1.5B-Instruct.i1-IQ1_S.gguf b/AceMath-1.5B-Instruct.i1-IQ1_S.gguf new file mode 100644 index 0000000..c052111 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0e14a835d52035ba21dba2d9802141dc59a799f51bce3f44868e7cd8a14e0585 +size 513101728 diff --git a/AceMath-1.5B-Instruct.i1-IQ2_M.gguf b/AceMath-1.5B-Instruct.i1-IQ2_M.gguf new file mode 100644 index 0000000..e13bc24 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:043e367563f4d4c9c4b86b8ea7cd64cbc66161345d83dad2c94346c2c50ca712 +size 701330848 diff --git a/AceMath-1.5B-Instruct.i1-IQ2_S.gguf b/AceMath-1.5B-Instruct.i1-IQ2_S.gguf new file mode 100644 index 0000000..45767cf --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f5a6d5848144867bda2cd6c3e8363e865960b0e24a2f50fa1339af0289f1fa2c +size 664085920 diff --git a/AceMath-1.5B-Instruct.i1-IQ2_XS.gguf b/AceMath-1.5B-Instruct.i1-IQ2_XS.gguf new file mode 100644 index 0000000..10ff1c4 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:70683834f55d57f86b66df0dfcd2b3b425a8633cc1e5d9ff644121452cb4deaa +size 626900896 diff --git a/AceMath-1.5B-Instruct.i1-IQ2_XXS.gguf b/AceMath-1.5B-Instruct.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..269e35e --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0b524a828e7fd1dfb09997e78e802f7270afe776d7a1261e51a38c03e15de8b4 +size 587591584 diff --git a/AceMath-1.5B-Instruct.i1-IQ3_M.gguf b/AceMath-1.5B-Instruct.i1-IQ3_M.gguf new file mode 100644 index 0000000..d280c0b --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:86bc34c37f7901f236093522972398420376ff9408afba9d6d21850564e8990c +size 876940192 diff --git a/AceMath-1.5B-Instruct.i1-IQ3_S.gguf b/AceMath-1.5B-Instruct.i1-IQ3_S.gguf new file mode 100644 index 0000000..b764da8 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0fffb9e6b5f36079a6c708f84e481b64b4cf48dc344c8bea7872502862ca66e2 +size 862683040 diff --git a/AceMath-1.5B-Instruct.i1-IQ3_XS.gguf b/AceMath-1.5B-Instruct.i1-IQ3_XS.gguf new file mode 100644 index 0000000..076c4cf --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0d7052c2526c74138c72474ac8eac7bc42768e4b437385bb8baf9d58c5c2a0b0 +size 831975328 diff --git a/AceMath-1.5B-Instruct.i1-IQ3_XXS.gguf b/AceMath-1.5B-Instruct.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..bdfba56 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8de1c2c576c28a77a2b172c31a62131a5f504fe3ec47257b5be44f4f782c03d3 +size 769068448 diff --git a/AceMath-1.5B-Instruct.i1-IQ4_NL.gguf b/AceMath-1.5B-Instruct.i1-IQ4_NL.gguf new file mode 100644 index 0000000..a840ec3 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:55a3ae824b4c85ab17848d8dc2ed7b2578b5caf017bf1b442cface1dc242f65e +size 1067602336 diff --git a/AceMath-1.5B-Instruct.i1-IQ4_XS.gguf b/AceMath-1.5B-Instruct.i1-IQ4_XS.gguf new file mode 100644 index 0000000..68f5437 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:857f8966d7be104a0f46ca943d3fdaa2d5d543b94616f8aef1ae3bbe503c05b7 +size 1019709856 diff --git a/AceMath-1.5B-Instruct.i1-Q2_K.gguf b/AceMath-1.5B-Instruct.i1-Q2_K.gguf new file mode 100644 index 0000000..75a0960 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:97cf6c31726a5e7f81cd94401bb49f0b924adf1f36a7d3b08dad4dfa993b711b +size 752879008 diff --git a/AceMath-1.5B-Instruct.i1-Q2_K_S.gguf b/AceMath-1.5B-Instruct.i1-Q2_K_S.gguf new file mode 100644 index 0000000..62e40ea --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7e96c212205e5e9923373869f3ffae46bfc5989c4c978d4debf12cea48419289 +size 716709280 diff --git a/AceMath-1.5B-Instruct.i1-Q3_K_L.gguf b/AceMath-1.5B-Instruct.i1-Q3_K_L.gguf new file mode 100644 index 0000000..eb5acd4 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1b2c0b9192379c21fc8537dea9983aedae788f46774f88916d567e0e23a0bdad +size 980438944 diff --git a/AceMath-1.5B-Instruct.i1-Q3_K_M.gguf b/AceMath-1.5B-Instruct.i1-Q3_K_M.gguf new file mode 100644 index 0000000..37bcb28 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ef7b15c69d2fedc4fa30d80a483775436f7cf0f47427ea1a75f8bfbd0d4684bc +size 924454816 diff --git a/AceMath-1.5B-Instruct.i1-Q3_K_S.gguf b/AceMath-1.5B-Instruct.i1-Q3_K_S.gguf new file mode 100644 index 0000000..d7030e1 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e4b44b46fdb59f4303bcfd4b838dacc451c9a6306af76ff223a0b2fece5a92d8 +size 861220768 diff --git a/AceMath-1.5B-Instruct.i1-Q4_0.gguf b/AceMath-1.5B-Instruct.i1-Q4_0.gguf new file mode 100644 index 0000000..53e5f64 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bc6a68e4e6d1c5533d8fd6ed61cf9b42d798be597f776f5016faf869e289a8a4 +size 1068806560 diff --git a/AceMath-1.5B-Instruct.i1-Q4_1.gguf b/AceMath-1.5B-Instruct.i1-Q4_1.gguf new file mode 100644 index 0000000..a2776c9 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fff53e70cca40eb52b79d44d4179311abbcc5e1b0dfe4f356730422ed7a75c7c +size 1162699168 diff --git a/AceMath-1.5B-Instruct.i1-Q4_K_M.gguf b/AceMath-1.5B-Instruct.i1-Q4_K_M.gguf new file mode 100644 index 0000000..f9a9481 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bb748730812d974fa6503ff2a7122620e5429eb2c9bf126ae94baaaeba79fe69 +size 1117319584 diff --git a/AceMath-1.5B-Instruct.i1-Q4_K_S.gguf b/AceMath-1.5B-Instruct.i1-Q4_K_S.gguf new file mode 100644 index 0000000..68f2623 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e0cdbfce9bd0f1c9017405c0e925da113daec66717f90d5b738fecb105d1bc62 +size 1071583648 diff --git a/AceMath-1.5B-Instruct.i1-Q5_K_M.gguf b/AceMath-1.5B-Instruct.i1-Q5_K_M.gguf new file mode 100644 index 0000000..8e551df --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8b669642f8bb9b713133067f8285f2c8a98f82de69ebf44c31a7d6ba651464b2 +size 1285493152 diff --git a/AceMath-1.5B-Instruct.i1-Q5_K_S.gguf b/AceMath-1.5B-Instruct.i1-Q5_K_S.gguf new file mode 100644 index 0000000..f28baa1 --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a66c88bd6d6ddad709f0c4bc854e8c6f5ef795c6f0ffb445a7d3ed6550374b5c +size 1259172256 diff --git a/AceMath-1.5B-Instruct.i1-Q6_K.gguf b/AceMath-1.5B-Instruct.i1-Q6_K.gguf new file mode 100644 index 0000000..cec4bac --- /dev/null +++ b/AceMath-1.5B-Instruct.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ac92c900d075b89a1115c768b394c41abd59f40a41473a49e5e27848bfc2676a +size 1464177568 diff --git a/README.md b/README.md new file mode 100644 index 0000000..01fe3be --- /dev/null +++ b/README.md @@ -0,0 +1,82 @@ +--- +base_model: nvidia/AceMath-1.5B-Instruct +language: +- en +library_name: transformers +license: cc-by-nc-4.0 +quantized_by: mradermacher +tags: +- nvidia +- AceMath +- math +- CoT +- pytorch +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/nvidia/AceMath-1.5B-Instruct + + +static quants are available at https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ1_S.gguf) | i1-IQ1_S | 0.6 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ1_M.gguf) | i1-IQ1_M | 0.6 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.7 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.7 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ2_S.gguf) | i1-IQ2_S | 0.8 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ2_M.gguf) | i1-IQ2_M | 0.8 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.8 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q2_K.gguf) | i1-Q2_K | 0.9 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.9 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.9 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q3_K_S.gguf) | i1-Q3_K_S | 1.0 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ3_S.gguf) | i1-IQ3_S | 1.0 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ3_M.gguf) | i1-IQ3_M | 1.0 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q3_K_M.gguf) | i1-Q3_K_M | 1.0 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q3_K_L.gguf) | i1-Q3_K_L | 1.1 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ4_XS.gguf) | i1-IQ4_XS | 1.1 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-IQ4_NL.gguf) | i1-IQ4_NL | 1.2 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q4_0.gguf) | i1-Q4_0 | 1.2 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q4_K_S.gguf) | i1-Q4_K_S | 1.2 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q4_K_M.gguf) | i1-Q4_K_M | 1.2 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q4_1.gguf) | i1-Q4_1 | 1.3 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q5_K_S.gguf) | i1-Q5_K_S | 1.4 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q5_K_M.gguf) | i1-Q5_K_M | 1.4 | | +| [GGUF](https://huggingface.co/mradermacher/AceMath-1.5B-Instruct-i1-GGUF/resolve/main/AceMath-1.5B-Instruct.i1-Q6_K.gguf) | i1-Q6_K | 1.6 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..a225482 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4b743b31ec7c9cef3667ca2a9fa69570fd9a520b5ab3c00ca243ad1715883d98 +size 2042201