commit 9ac8df3f6d096a1432d0dca632e93be41cca64b4 Author: ModelHub XC Date: Wed May 27 05:00:17 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/UME-R1-2B-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..4a1ef29 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,49 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.mmproj-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.mmproj-f16.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +UME-R1-2B.f16.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..30258a2 --- /dev/null +++ b/README.md @@ -0,0 +1,79 @@ +--- +base_model: zhibinlan/UME-R1-2B +language: +- en +library_name: transformers +license: apache-2.0 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- Sentence Similarity +- Embedding +- zero-shot-image-classification +- video-text-to-text +--- +## About + + + + + + + + + +static quants of https://huggingface.co/zhibinlan/UME-R1-2B + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#UME-R1-2B-GGUF).*** + +weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion. +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q2_K.gguf) | Q2_K | 0.8 | | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.mmproj-Q8_0.gguf) | mmproj-Q8_0 | 0.8 | multi-modal supplement | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q3_K_S.gguf) | Q3_K_S | 0.9 | | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q3_K_M.gguf) | Q3_K_M | 0.9 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q3_K_L.gguf) | Q3_K_L | 1.0 | | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.IQ4_XS.gguf) | IQ4_XS | 1.0 | | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q4_K_S.gguf) | Q4_K_S | 1.0 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q4_K_M.gguf) | Q4_K_M | 1.1 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q5_K_S.gguf) | Q5_K_S | 1.2 | | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q5_K_M.gguf) | Q5_K_M | 1.2 | | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q6_K.gguf) | Q6_K | 1.4 | very good quality | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.mmproj-f16.gguf) | mmproj-f16 | 1.4 | multi-modal supplement | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.Q8_0.gguf) | Q8_0 | 1.7 | fast, best quality | +| [GGUF](https://huggingface.co/mradermacher/UME-R1-2B-GGUF/resolve/main/UME-R1-2B.f16.gguf) | f16 | 3.2 | 16 bpw, overkill | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. + + diff --git a/UME-R1-2B.IQ4_XS.gguf b/UME-R1-2B.IQ4_XS.gguf new file mode 100644 index 0000000..498db24 --- /dev/null +++ b/UME-R1-2B.IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7cde012a81708d46b094aa0f763532cbc62ac54647b1756309bf0ac9392d1b99 +size 902181600 diff --git a/UME-R1-2B.Q2_K.gguf b/UME-R1-2B.Q2_K.gguf new file mode 100644 index 0000000..0923779 --- /dev/null +++ b/UME-R1-2B.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5624466499cd352c20e6f7bee51ff3c22167e936b46dcf1c7a5d9e89249ba03c +size 676303584 diff --git a/UME-R1-2B.Q3_K_L.gguf b/UME-R1-2B.Q3_K_L.gguf new file mode 100644 index 0000000..ab046ec --- /dev/null +++ b/UME-R1-2B.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2be3bea5550bab512f875bc389a56d2d2475a98c07226a05b80e3d0f236602f3 +size 880161504 diff --git a/UME-R1-2B.Q3_K_M.gguf b/UME-R1-2B.Q3_K_M.gguf new file mode 100644 index 0000000..6c90dc8 --- /dev/null +++ b/UME-R1-2B.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c6420e627c740e9cdb3b821300e64c0e977dbed7fb8b86d0467908af25020898 +size 824177376 diff --git a/UME-R1-2B.Q3_K_S.gguf b/UME-R1-2B.Q3_K_S.gguf new file mode 100644 index 0000000..7c76f8d --- /dev/null +++ b/UME-R1-2B.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d9fe768d625bb03c6f1f16c04a0c09e1ad8cdb94354f24c3131c104a2158e2ce +size 760943328 diff --git a/UME-R1-2B.Q4_K_M.gguf b/UME-R1-2B.Q4_K_M.gguf new file mode 100644 index 0000000..d815ba0 --- /dev/null +++ b/UME-R1-2B.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ddddf8da41d98e9aec6fc3ddb57d99d4a295cb89e97712332b1d3bc640b5ee69 +size 986047200 diff --git a/UME-R1-2B.Q4_K_S.gguf b/UME-R1-2B.Q4_K_S.gguf new file mode 100644 index 0000000..3df05ce --- /dev/null +++ b/UME-R1-2B.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ba7e7a42605d397bd5891f3c5f6abf49092abdf7a5b19575e2809d46b34c21af +size 940311264 diff --git a/UME-R1-2B.Q5_K_M.gguf b/UME-R1-2B.Q5_K_M.gguf new file mode 100644 index 0000000..727e5ee --- /dev/null +++ b/UME-R1-2B.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c39230da1100159bec5bd0ebd161b022e76bf70a9b446b8fe37d0e11dec8d2e7 +size 1125049056 diff --git a/UME-R1-2B.Q5_K_S.gguf b/UME-R1-2B.Q5_K_S.gguf new file mode 100644 index 0000000..689fc01 --- /dev/null +++ b/UME-R1-2B.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6624a47d09da47017c9f98626dc28116fe488b8e41803f132073288e95d4e303 +size 1098728160 diff --git a/UME-R1-2B.Q6_K.gguf b/UME-R1-2B.Q6_K.gguf new file mode 100644 index 0000000..4213e42 --- /dev/null +++ b/UME-R1-2B.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:82c80f7af3cadc6b2539b0c9b1f73fc69324d4a1efc7c1dff3fd390749185747 +size 1272738528 diff --git a/UME-R1-2B.Q8_0.gguf b/UME-R1-2B.Q8_0.gguf new file mode 100644 index 0000000..597cc81 --- /dev/null +++ b/UME-R1-2B.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:907a40fdb8445af515f28b75a7145f1998758ef884b0839905012bfb2df57542 +size 1646571744 diff --git a/UME-R1-2B.f16.gguf b/UME-R1-2B.f16.gguf new file mode 100644 index 0000000..a9a6eea --- /dev/null +++ b/UME-R1-2B.f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:6ade394b377ff0b62715152cb666b915a7ec916baba85bc9404a7e5dacf9251f +size 3093668064 diff --git a/UME-R1-2B.mmproj-Q8_0.gguf b/UME-R1-2B.mmproj-Q8_0.gguf new file mode 100644 index 0000000..594dc1e --- /dev/null +++ b/UME-R1-2B.mmproj-Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4194ecd65ccdd45bbcdd288752718c014300ee3f25cce31f6e66d59e3fb42c79 +size 712893792 diff --git a/UME-R1-2B.mmproj-f16.gguf b/UME-R1-2B.mmproj-f16.gguf new file mode 100644 index 0000000..941506e --- /dev/null +++ b/UME-R1-2B.mmproj-f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8f37b8f905dbfb2f72c8b6663ebae2f65e394448b16d6d667c87bb0db65ecb6e +size 1331656032