commit 9e19a4b4dc0a09c333f938b3c43c781ffb90e5c3 Author: ModelHub XC Date: Sun May 24 19:00:18 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Reasoning-Gen-8B-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..7b30b83 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,47 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.f16.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Reasoning-Gen-8B.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..15ed744 --- /dev/null +++ b/README.md @@ -0,0 +1,63 @@ +--- +base_model: GusPuffy/Reasoning-Gen-8B +language: +- en +library_name: transformers +quantized_by: mradermacher +--- +## About + + + + + + +static quants of https://huggingface.co/GusPuffy/Reasoning-Gen-8B + + +weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion. +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q2_K.gguf) | Q2_K | 3.4 | | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q3_K_S.gguf) | Q3_K_S | 3.9 | | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q3_K_M.gguf) | Q3_K_M | 4.2 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q3_K_L.gguf) | Q3_K_L | 4.5 | | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.IQ4_XS.gguf) | IQ4_XS | 4.7 | | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q4_K_S.gguf) | Q4_K_S | 4.9 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q4_K_M.gguf) | Q4_K_M | 5.1 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q5_K_S.gguf) | Q5_K_S | 5.8 | | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q5_K_M.gguf) | Q5_K_M | 6.0 | | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q6_K.gguf) | Q6_K | 6.8 | very good quality | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.Q8_0.gguf) | Q8_0 | 8.8 | fast, best quality | +| [GGUF](https://huggingface.co/mradermacher/Reasoning-Gen-8B-GGUF/resolve/main/Reasoning-Gen-8B.f16.gguf) | f16 | 16.5 | 16 bpw, overkill | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. + + diff --git a/Reasoning-Gen-8B.IQ4_XS.gguf b/Reasoning-Gen-8B.IQ4_XS.gguf new file mode 100644 index 0000000..429f6f5 --- /dev/null +++ b/Reasoning-Gen-8B.IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6b550810994f4b2b98b2f836942537e409636d75ac74733012a252fee4180c04 +size 4593297280 diff --git a/Reasoning-Gen-8B.Q2_K.gguf b/Reasoning-Gen-8B.Q2_K.gguf new file mode 100644 index 0000000..0e76d8e --- /dev/null +++ b/Reasoning-Gen-8B.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:501c95ad4e991e7f0f3c0764b2e95973e4f39e3148b396437df1d10cc5a285dd +size 3281733504 diff --git a/Reasoning-Gen-8B.Q3_K_L.gguf b/Reasoning-Gen-8B.Q3_K_L.gguf new file mode 100644 index 0000000..cc9d4e5 --- /dev/null +++ b/Reasoning-Gen-8B.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f6d58d21c4e57f59487e3ff7c7743f5b0111cc8b4ec25d3d615fca805c00fac8 +size 4431394688 diff --git a/Reasoning-Gen-8B.Q3_K_M.gguf b/Reasoning-Gen-8B.Q3_K_M.gguf new file mode 100644 index 0000000..b816e2f --- /dev/null +++ b/Reasoning-Gen-8B.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:162a412a4e0031e611dbb7d222052ad0bbda4490fc1daf201f0c746d45757e49 +size 4124161920 diff --git a/Reasoning-Gen-8B.Q3_K_S.gguf b/Reasoning-Gen-8B.Q3_K_S.gguf new file mode 100644 index 0000000..99e1c2c --- /dev/null +++ b/Reasoning-Gen-8B.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7b937a583ad24d9cc1fd251170584278f157631f0c3b8492ab3018618266cda4 +size 3769612160 diff --git a/Reasoning-Gen-8B.Q4_K_M.gguf b/Reasoning-Gen-8B.Q4_K_M.gguf new file mode 100644 index 0000000..40e1705 --- /dev/null +++ b/Reasoning-Gen-8B.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4a411992521df9b07f9c7c4e92cf60a28bf6df0c5374b97c14bac09a285b4eed +size 5027784576 diff --git a/Reasoning-Gen-8B.Q4_K_S.gguf b/Reasoning-Gen-8B.Q4_K_S.gguf new file mode 100644 index 0000000..e45b180 --- /dev/null +++ b/Reasoning-Gen-8B.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e2bf38be7b50753994fb6895b37a4cf538cf826d31cd4d32601a28be9f9d0003 +size 4802013056 diff --git a/Reasoning-Gen-8B.Q5_K_M.gguf b/Reasoning-Gen-8B.Q5_K_M.gguf new file mode 100644 index 0000000..64e2dab --- /dev/null +++ b/Reasoning-Gen-8B.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d14eafcf6d3e84bceb45e4b8ef71f774c205ce442b75c34367a211c64afb9fea +size 5851113344 diff --git a/Reasoning-Gen-8B.Q5_K_S.gguf b/Reasoning-Gen-8B.Q5_K_S.gguf new file mode 100644 index 0000000..160a4a6 --- /dev/null +++ b/Reasoning-Gen-8B.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5ecea1a2f1afc8ff366b6eebb6aec418f6b24416471f66fc4643463c3783b5bb +size 5720762240 diff --git a/Reasoning-Gen-8B.Q6_K.gguf b/Reasoning-Gen-8B.Q6_K.gguf new file mode 100644 index 0000000..72d6c73 --- /dev/null +++ b/Reasoning-Gen-8B.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2fb7f31b473d11e3e0e1330d61c8631d7625acc1a3a298952e955a92b32e0793 +size 6725900160 diff --git a/Reasoning-Gen-8B.Q8_0.gguf b/Reasoning-Gen-8B.Q8_0.gguf new file mode 100644 index 0000000..eeea78e --- /dev/null +++ b/Reasoning-Gen-8B.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cadbd33f23ccdef9a0033033b77d4c8efffd53b9c5b2664b5c61733925a9ac39 +size 8709519232 diff --git a/Reasoning-Gen-8B.f16.gguf b/Reasoning-Gen-8B.f16.gguf new file mode 100644 index 0000000..e37bec3 --- /dev/null +++ b/Reasoning-Gen-8B.f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:aff6d61b3b3bb1ea00e9eea34d826c5a68ab31705cc2b3f4ef95ee30e0de4938 +size 16388044672