From 4f6ccf8dbe301ab9039840373f6363eaaf5a46f6 Mon Sep 17 00:00:00 2001 From: ModelHub XC Date: Sat, 11 Apr 2026 13:43:00 +0800 Subject: [PATCH] =?UTF-8?q?=E5=88=9D=E5=A7=8B=E5=8C=96=E9=A1=B9=E7=9B=AE?= =?UTF-8?q?=EF=BC=8C=E7=94=B1ModelHub=20XC=E7=A4=BE=E5=8C=BA=E6=8F=90?= =?UTF-8?q?=E4=BE=9B=E6=A8=A1=E5=9E=8B?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Model: mradermacher/rwkv7-0.4B-world-GGUF Source: Original Platform --- .gitattributes | 47 ++++++++++++++++++++++++ README.md | 71 ++++++++++++++++++++++++++++++++++++ rwkv7-0.4B-world.IQ4_XS.gguf | 3 ++ rwkv7-0.4B-world.Q2_K.gguf | 3 ++ rwkv7-0.4B-world.Q3_K_L.gguf | 3 ++ rwkv7-0.4B-world.Q3_K_M.gguf | 3 ++ rwkv7-0.4B-world.Q3_K_S.gguf | 3 ++ rwkv7-0.4B-world.Q4_K_M.gguf | 3 ++ rwkv7-0.4B-world.Q4_K_S.gguf | 3 ++ rwkv7-0.4B-world.Q5_K_M.gguf | 3 ++ rwkv7-0.4B-world.Q5_K_S.gguf | 3 ++ rwkv7-0.4B-world.Q6_K.gguf | 3 ++ rwkv7-0.4B-world.Q8_0.gguf | 3 ++ rwkv7-0.4B-world.f16.gguf | 3 ++ 14 files changed, 154 insertions(+) create mode 100644 .gitattributes create mode 100644 README.md create mode 100644 rwkv7-0.4B-world.IQ4_XS.gguf create mode 100644 rwkv7-0.4B-world.Q2_K.gguf create mode 100644 rwkv7-0.4B-world.Q3_K_L.gguf create mode 100644 rwkv7-0.4B-world.Q3_K_M.gguf create mode 100644 rwkv7-0.4B-world.Q3_K_S.gguf create mode 100644 rwkv7-0.4B-world.Q4_K_M.gguf create mode 100644 rwkv7-0.4B-world.Q4_K_S.gguf create mode 100644 rwkv7-0.4B-world.Q5_K_M.gguf create mode 100644 rwkv7-0.4B-world.Q5_K_S.gguf create mode 100644 rwkv7-0.4B-world.Q6_K.gguf create mode 100644 rwkv7-0.4B-world.Q8_0.gguf create mode 100644 rwkv7-0.4B-world.f16.gguf diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..613c145 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,47 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text +rwkv7-0.4B-world.f16.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..02f3a2e --- /dev/null +++ b/README.md @@ -0,0 +1,71 @@ +--- +base_model: fla-hub/rwkv7-0.4B-world +language: +- en +- zh +- ja +- ko +- fr +- ar +- es +- pt +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +--- +## About + + + + + + +static quants of https://huggingface.co/fla-hub/rwkv7-0.4B-world + + +weighted/imatrix quants are available at https://huggingface.co/mradermacher/rwkv7-0.4B-world-i1-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q2_K.gguf) | Q2_K | 0.3 | | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q3_K_L.gguf) | Q3_K_L | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q3_K_M.gguf) | Q3_K_M | 0.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q3_K_S.gguf) | Q3_K_S | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.IQ4_XS.gguf) | IQ4_XS | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q4_K_M.gguf) | Q4_K_M | 0.4 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q4_K_S.gguf) | Q4_K_S | 0.4 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q5_K_M.gguf) | Q5_K_M | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q5_K_S.gguf) | Q5_K_S | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q6_K.gguf) | Q6_K | 0.5 | very good quality | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.Q8_0.gguf) | Q8_0 | 0.6 | fast, best quality | +| [GGUF](https://huggingface.co/mradermacher/rwkv7-0.4B-world-GGUF/resolve/main/rwkv7-0.4B-world.f16.gguf) | f16 | 1.0 | 16 bpw, overkill | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. + + diff --git a/rwkv7-0.4B-world.IQ4_XS.gguf b/rwkv7-0.4B-world.IQ4_XS.gguf new file mode 100644 index 0000000..b5ca04e --- /dev/null +++ b/rwkv7-0.4B-world.IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2510a97b909e1b551c03313aea7142721337e8e576cf36485bfc17c5a214c95a +size 289161120 diff --git a/rwkv7-0.4B-world.Q2_K.gguf b/rwkv7-0.4B-world.Q2_K.gguf new file mode 100644 index 0000000..bbb2616 --- /dev/null +++ b/rwkv7-0.4B-world.Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:699abfa326c531b26f0eac21254194b0b1aa6a2654db900b8a466a752f377e98 +size 214187936 diff --git a/rwkv7-0.4B-world.Q3_K_L.gguf b/rwkv7-0.4B-world.Q3_K_L.gguf new file mode 100644 index 0000000..4a9d516 --- /dev/null +++ b/rwkv7-0.4B-world.Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a0ac9e56770cb3490c71e6f3f5aa90bc690444eaffb8215f8967bc772340344d +size 251674528 diff --git a/rwkv7-0.4B-world.Q3_K_M.gguf b/rwkv7-0.4B-world.Q3_K_M.gguf new file mode 100644 index 0000000..0cb3e90 --- /dev/null +++ b/rwkv7-0.4B-world.Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:932fc75f345f8477813fbea55323de50e66703c6ceea687349c95e8bb7a42f7e +size 251674528 diff --git a/rwkv7-0.4B-world.Q3_K_S.gguf b/rwkv7-0.4B-world.Q3_K_S.gguf new file mode 100644 index 0000000..b46ff33 --- /dev/null +++ b/rwkv7-0.4B-world.Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:33194ca092b9a8f2a59b03cc34ad4bee3f602a347fd7d3d18e57f7b30e8ace4c +size 251674528 diff --git a/rwkv7-0.4B-world.Q4_K_M.gguf b/rwkv7-0.4B-world.Q4_K_M.gguf new file mode 100644 index 0000000..9e80c28 --- /dev/null +++ b/rwkv7-0.4B-world.Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:25a6de3e6c99d36540b844752435256d97e082d92b389e51fcdef2deee3c88c0 +size 300695456 diff --git a/rwkv7-0.4B-world.Q4_K_S.gguf b/rwkv7-0.4B-world.Q4_K_S.gguf new file mode 100644 index 0000000..6ff63c7 --- /dev/null +++ b/rwkv7-0.4B-world.Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ee7422aa427b0fa6de4a235a96ec5d6aae34565f3b452df0b1d8149b68cf7365 +size 300695456 diff --git a/rwkv7-0.4B-world.Q5_K_M.gguf b/rwkv7-0.4B-world.Q5_K_M.gguf new file mode 100644 index 0000000..298ee16 --- /dev/null +++ b/rwkv7-0.4B-world.Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e4bb04a39934e8e2aaca258fbf8f2d6bd6db41dce0580f8dc383edde8b97806b +size 346832800 diff --git a/rwkv7-0.4B-world.Q5_K_S.gguf b/rwkv7-0.4B-world.Q5_K_S.gguf new file mode 100644 index 0000000..83037ac --- /dev/null +++ b/rwkv7-0.4B-world.Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:30731e76fc188d089215daabf6a1c983f44761742e0fe54b17c50fa2b87665f5 +size 346832800 diff --git a/rwkv7-0.4B-world.Q6_K.gguf b/rwkv7-0.4B-world.Q6_K.gguf new file mode 100644 index 0000000..a3d7e20 --- /dev/null +++ b/rwkv7-0.4B-world.Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fb9f05ba7dd0b92ccb19b085925d01a4d13501edb6beecc9c3d5d56e8cb1a6da +size 395853728 diff --git a/rwkv7-0.4B-world.Q8_0.gguf b/rwkv7-0.4B-world.Q8_0.gguf new file mode 100644 index 0000000..ae8125c --- /dev/null +++ b/rwkv7-0.4B-world.Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8ab668e21f5da2495e791c35d55e83b09a63e53a0649e5483182cc1778543f49 +size 501497760 diff --git a/rwkv7-0.4B-world.f16.gguf b/rwkv7-0.4B-world.f16.gguf new file mode 100644 index 0000000..163173e --- /dev/null +++ b/rwkv7-0.4B-world.f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b71325c810de195c634276d9c0b124d7a24adab539e551168daf07c92ec1d179 +size 910442400