commit a864bd6eab9137cde4a3a0cec8243e280975d0d4 Author: ModelHub XC Date: Sun Apr 26 21:51:31 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Yumo-nano-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..21d7bb8 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,47 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Yumo-nano.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +Yumo-nano.f16.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..9d7c5c1 --- /dev/null +++ b/README.md @@ -0,0 +1,87 @@ +--- +base_model: YU-MO/Yumo-nano +datasets: +- YU-MO/Yumo-dataset +- EleutherAI/hendrycks_math +language: +- en +- es +library_name: transformers +license: apache-2.0 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- reasoning +- unsloth +- pytorch +- bilingual +- opceanai +- yumo +- fine-tuned +- chat +- deepseek +- qwen2 +--- +## About + + + + + + + + + +static quants of https://huggingface.co/YU-MO/Yumo-nano + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Yumo-nano-GGUF).*** + +weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion. +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q2_K.gguf) | Q2_K | 0.9 | | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q3_K_S.gguf) | Q3_K_S | 1.0 | | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q3_K_M.gguf) | Q3_K_M | 1.0 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q3_K_L.gguf) | Q3_K_L | 1.1 | | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.IQ4_XS.gguf) | IQ4_XS | 1.1 | | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q4_K_S.gguf) | Q4_K_S | 1.2 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q4_K_M.gguf) | Q4_K_M | 1.2 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q5_K_S.gguf) | Q5_K_S | 1.4 | | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q5_K_M.gguf) | Q5_K_M | 1.4 | | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q6_K.gguf) | Q6_K | 1.6 | very good quality | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.Q8_0.gguf) | Q8_0 | 2.0 | fast, best quality | +| [GGUF](https://huggingface.co/mradermacher/Yumo-nano-GGUF/resolve/main/Yumo-nano.f16.gguf) | f16 | 3.7 | 16 bpw, overkill | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. + + diff --git a/Yumo-nano.IQ4_XS.gguf b/Yumo-nano.IQ4_XS.gguf new file mode 100644 index 0000000..15e035f --- /dev/null +++ b/Yumo-nano.IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:416b52491040f883522e1a1f3a50d2b10de84076aa8c5088f5be6a39b00732c6 +size 1026163360 diff --git a/Yumo-nano.Q2_K.gguf b/Yumo-nano.Q2_K.gguf new file mode 100644 index 0000000..cc0831a --- /dev/null +++ b/Yumo-nano.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:54f579018bd82527f4d2d625bb8db3b2d51c77f6ea003cfdd9a9a4eb4fd8583a +size 752881312 diff --git a/Yumo-nano.Q3_K_L.gguf b/Yumo-nano.Q3_K_L.gguf new file mode 100644 index 0000000..ecd8950 --- /dev/null +++ b/Yumo-nano.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ae58dd9212e457f660293f800fb19a61c0dfdae1e0364ee57ee564849bec2625 +size 980441248 diff --git a/Yumo-nano.Q3_K_M.gguf b/Yumo-nano.Q3_K_M.gguf new file mode 100644 index 0000000..b41e49a --- /dev/null +++ b/Yumo-nano.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f25d266335a1652581f1792b17ea72ac81e1b907e5dc41142c8493e7b4a7aa68 +size 924457120 diff --git a/Yumo-nano.Q3_K_S.gguf b/Yumo-nano.Q3_K_S.gguf new file mode 100644 index 0000000..39e609c --- /dev/null +++ b/Yumo-nano.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1ef7e524342bd208fb728c25a1c34c65ecb62e9b3c1ce1283147759e83af98ee +size 861223072 diff --git a/Yumo-nano.Q4_K_M.gguf b/Yumo-nano.Q4_K_M.gguf new file mode 100644 index 0000000..44a37bd --- /dev/null +++ b/Yumo-nano.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8ecf1bf8343df704b921abd4cc8538f4e6ee163d7961a26780161c9246f94e3c +size 1117321888 diff --git a/Yumo-nano.Q4_K_S.gguf b/Yumo-nano.Q4_K_S.gguf new file mode 100644 index 0000000..f131c31 --- /dev/null +++ b/Yumo-nano.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7ae17afb8cadf7882f7fb52a0c1eaf2fc308b6217d71e4cfab6359f5fea3cd85 +size 1071585952 diff --git a/Yumo-nano.Q5_K_M.gguf b/Yumo-nano.Q5_K_M.gguf new file mode 100644 index 0000000..e3dede3 --- /dev/null +++ b/Yumo-nano.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3dbcbb6830206205e18dc0e9caff354408aefbef8733ea0e0d0f27293f683158 +size 1285495456 diff --git a/Yumo-nano.Q5_K_S.gguf b/Yumo-nano.Q5_K_S.gguf new file mode 100644 index 0000000..725b2ae --- /dev/null +++ b/Yumo-nano.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5ff65b68290f74518f17ff1664ccb94c647938cb71ee1297cf4020c74e0d67bb +size 1259174560 diff --git a/Yumo-nano.Q6_K.gguf b/Yumo-nano.Q6_K.gguf new file mode 100644 index 0000000..b704918 --- /dev/null +++ b/Yumo-nano.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e3b8980f5e5d390de90ad16d2831d7e0a016bf57e3972e6c82e5c623d62a43b4 +size 1464179872 diff --git a/Yumo-nano.Q8_0.gguf b/Yumo-nano.Q8_0.gguf new file mode 100644 index 0000000..12a78e3 --- /dev/null +++ b/Yumo-nano.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3086148273cecda088fad7b825f9c70ad9a85bef40932523ccaa06d32d478a49 +size 1894533280 diff --git a/Yumo-nano.f16.gguf b/Yumo-nano.f16.gguf new file mode 100644 index 0000000..d296a9d --- /dev/null +++ b/Yumo-nano.f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3f16b73bef2afac2977d402cf6a9fb6800a3bd0e46d26d9d9c7733ddad22b737 +size 3560417440