commit a37218c49c8ae4e1437e682341225e52f3e3439d Author: ModelHub XC Date: Sat Jun 6 20:04:18 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/WilR-mini-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..9c1ca47 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +WilR-mini.imatrix.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +WilR-mini.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..d6a0ac9 --- /dev/null +++ b/README.md @@ -0,0 +1,88 @@ +--- +base_model: James7765/WilR-mini +language: +- en +library_name: transformers +license: apache-2.0 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- text-generation-inference +- transformers +--- +## About + + + + + + + + + +weighted/imatrix quants of https://huggingface.co/James7765/WilR-mini + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#WilR-mini-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/WilR-mini-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ1_S.gguf) | i1-IQ1_S | 1.2 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ1_M.gguf) | i1-IQ1_M | 1.3 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 1.4 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ2_XS.gguf) | i1-IQ2_XS | 1.5 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ2_S.gguf) | i1-IQ2_S | 1.5 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ2_M.gguf) | i1-IQ2_M | 1.6 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q2_K_S.gguf) | i1-Q2_K_S | 1.7 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 1.8 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q2_K.gguf) | i1-Q2_K | 1.8 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ3_XS.gguf) | i1-IQ3_XS | 2.0 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ3_S.gguf) | i1-IQ3_S | 2.0 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q3_K_S.gguf) | i1-Q3_K_S | 2.0 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ3_M.gguf) | i1-IQ3_M | 2.1 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q3_K_M.gguf) | i1-Q3_K_M | 2.2 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q3_K_L.gguf) | i1-Q3_K_L | 2.3 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ4_XS.gguf) | i1-IQ4_XS | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-IQ4_NL.gguf) | i1-IQ4_NL | 2.5 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q4_0.gguf) | i1-Q4_0 | 2.5 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q4_K_S.gguf) | i1-Q4_K_S | 2.5 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q4_K_M.gguf) | i1-Q4_K_M | 2.6 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q4_1.gguf) | i1-Q4_1 | 2.7 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q5_K_S.gguf) | i1-Q5_K_S | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q5_K_M.gguf) | i1-Q5_K_M | 2.9 | | +| [GGUF](https://huggingface.co/mradermacher/WilR-mini-i1-GGUF/resolve/main/WilR-mini.i1-Q6_K.gguf) | i1-Q6_K | 3.3 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/WilR-mini.i1-IQ1_M.gguf b/WilR-mini.i1-IQ1_M.gguf new file mode 100644 index 0000000..cc5f944 --- /dev/null +++ b/WilR-mini.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8e6e63890c3c08a5da2d864796452a6471878defc455b322acf9b7374af926a5 +size 1199571520 diff --git a/WilR-mini.i1-IQ1_S.gguf b/WilR-mini.i1-IQ1_S.gguf new file mode 100644 index 0000000..c07c37f --- /dev/null +++ b/WilR-mini.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9711ebbc858b288829507d04ea03a86f58dfb61f19c3feb65da73e9802047104 +size 1133093440 diff --git a/WilR-mini.i1-IQ2_M.gguf b/WilR-mini.i1-IQ2_M.gguf new file mode 100644 index 0000000..1e6d5ea --- /dev/null +++ b/WilR-mini.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e86389a4714bb8a8c7d6b90685f45ffe66c6d0a16e3522cd595d81606908108c +size 1537983040 diff --git a/WilR-mini.i1-IQ2_S.gguf b/WilR-mini.i1-IQ2_S.gguf new file mode 100644 index 0000000..32f49dd --- /dev/null +++ b/WilR-mini.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c8d1b3169fdf0808bd07afdd8ee63b30c41f51050f4908cad2957dbdc54086cf +size 1449345600 diff --git a/WilR-mini.i1-IQ2_XS.gguf b/WilR-mini.i1-IQ2_XS.gguf new file mode 100644 index 0000000..1301f5e --- /dev/null +++ b/WilR-mini.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:146383ae72fa00a868e2e47edd7ba4a6a29f08b312f11d94eede2ec5795cc9f5 +size 1404576320 diff --git a/WilR-mini.i1-IQ2_XXS.gguf b/WilR-mini.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..5c63120 --- /dev/null +++ b/WilR-mini.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a784c13f44f637e6ba4d978e9d7486c10b6307c0be46f8938386597c5f654bdc +size 1310368320 diff --git a/WilR-mini.i1-IQ3_M.gguf b/WilR-mini.i1-IQ3_M.gguf new file mode 100644 index 0000000..bacc52d --- /dev/null +++ b/WilR-mini.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9fbc27692a143b71424f4d0ac0fdc2ed7e0ddd834f39dd47a4da16922008e9db +size 1986803520 diff --git a/WilR-mini.i1-IQ3_S.gguf b/WilR-mini.i1-IQ3_S.gguf new file mode 100644 index 0000000..5890a9b --- /dev/null +++ b/WilR-mini.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:37479ec66ea004bda6669c7c92341570f4267feb2049189360a391f3d01fb1d9 +size 1937364800 diff --git a/WilR-mini.i1-IQ3_XS.gguf b/WilR-mini.i1-IQ3_XS.gguf new file mode 100644 index 0000000..28ee1c7 --- /dev/null +++ b/WilR-mini.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b0c057da034d8c736fa4a3ba224c209bba530858d274b073a7f77db8d4eb3809 +size 1863391040 diff --git a/WilR-mini.i1-IQ3_XXS.gguf b/WilR-mini.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..a13dd45 --- /dev/null +++ b/WilR-mini.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:19611279c8652aa5fba00cbc2fda37cfb5be0f71127239d7edb21eab815729b3 +size 1689453120 diff --git a/WilR-mini.i1-IQ4_NL.gguf b/WilR-mini.i1-IQ4_NL.gguf new file mode 100644 index 0000000..e0477c1 --- /dev/null +++ b/WilR-mini.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a96c6f3c55a71db6d55978e15875c0a4924b8bc2ebb2b8ebd9009c13130e3cc9 +size 2363512640 diff --git a/WilR-mini.i1-IQ4_XS.gguf b/WilR-mini.i1-IQ4_XS.gguf new file mode 100644 index 0000000..084f37a --- /dev/null +++ b/WilR-mini.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:da53ba81386a3c827c50e3f2df1661e9e2a4c99a172cf2f39f865c3e2c1371f9 +size 2263242560 diff --git a/WilR-mini.i1-Q2_K.gguf b/WilR-mini.i1-Q2_K.gguf new file mode 100644 index 0000000..bbb5d78 --- /dev/null +++ b/WilR-mini.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bbe750efbc5fb1504d1cdf6c5d3deef8b378b8f803c0234861fd64704c8ea61c +size 1729165120 diff --git a/WilR-mini.i1-Q2_K_S.gguf b/WilR-mini.i1-Q2_K_S.gguf new file mode 100644 index 0000000..8345493 --- /dev/null +++ b/WilR-mini.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2b036ad277132d0c4e9b49d1af2ea7c0d842523982cac2a5885971c5062f6335 +size 1636063040 diff --git a/WilR-mini.i1-Q3_K_L.gguf b/WilR-mini.i1-Q3_K_L.gguf new file mode 100644 index 0000000..4c02576 --- /dev/null +++ b/WilR-mini.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:41b8c2774c5a27fb977ddb8bbaaac149694c39a85d0a9ecd5996a1753af25e5d +size 2236086080 diff --git a/WilR-mini.i1-Q3_K_M.gguf b/WilR-mini.i1-Q3_K_M.gguf new file mode 100644 index 0000000..4833869 --- /dev/null +++ b/WilR-mini.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:64fff0cd03b79ee48bf1ba28f9abd0b069655814fbb7702cb165531443e91fc8 +size 2098460480 diff --git a/WilR-mini.i1-Q3_K_S.gguf b/WilR-mini.i1-Q3_K_S.gguf new file mode 100644 index 0000000..fdc81c5 --- /dev/null +++ b/WilR-mini.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1125ebbb273b12c94ff09fe14c366f7db2fb4c83a741f534a4c8bc563872a985 +size 1937364800 diff --git a/WilR-mini.i1-Q4_0.gguf b/WilR-mini.i1-Q4_0.gguf new file mode 100644 index 0000000..35db5e5 --- /dev/null +++ b/WilR-mini.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:06ea7c5145dd1a53431a1a556c9c34a9ff1cc1b3acad527eaf493ca8b2c01031 +size 2370066240 diff --git a/WilR-mini.i1-Q4_1.gguf b/WilR-mini.i1-Q4_1.gguf new file mode 100644 index 0000000..b75dc56 --- /dev/null +++ b/WilR-mini.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2378f804a841020d90b5e14fca9c40a4614fa48124b2118c99e4cbe8a47f41db +size 2564052800 diff --git a/WilR-mini.i1-Q4_K_M.gguf b/WilR-mini.i1-Q4_K_M.gguf new file mode 100644 index 0000000..82949d2 --- /dev/null +++ b/WilR-mini.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a416bc812cf59535a900a0a5c11a24c375fc599007534c53d8a2465eaf1cb846 +size 2489894720 diff --git a/WilR-mini.i1-Q4_K_S.gguf b/WilR-mini.i1-Q4_K_S.gguf new file mode 100644 index 0000000..171f4c1 --- /dev/null +++ b/WilR-mini.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:55b850b5bdfd34558b2dddee05a769e954da4f8d5247faf8f2e68f33ab84f011 +size 2377930560 diff --git a/WilR-mini.i1-Q5_K_M.gguf b/WilR-mini.i1-Q5_K_M.gguf new file mode 100644 index 0000000..acadd2f --- /dev/null +++ b/WilR-mini.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:84319092a5fc64c55fa978e6ed7f14878fcd60b90c6704962902e3959d1e38eb +size 2829698880 diff --git a/WilR-mini.i1-Q5_K_S.gguf b/WilR-mini.i1-Q5_K_S.gguf new file mode 100644 index 0000000..880ec24 --- /dev/null +++ b/WilR-mini.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:da3c8a83f2689f34525bb00d1c71ed45acf3a76abc3b9e6f6c83f2c2037f9d94 +size 2764592960 diff --git a/WilR-mini.i1-Q6_K.gguf b/WilR-mini.i1-Q6_K.gguf new file mode 100644 index 0000000..02d0eb8 --- /dev/null +++ b/WilR-mini.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2536d3dedfe0083cfeca59ae16e8447f9bf2d83ec8e11a1dfaa4e40f3c8eeae8 +size 3190740800 diff --git a/WilR-mini.imatrix.gguf b/WilR-mini.imatrix.gguf new file mode 100644 index 0000000..1c8b8ff --- /dev/null +++ b/WilR-mini.imatrix.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c98011b273c2bfcc0dde3ac28f21c2dd600a74903d625d4a5d05d8ddc75f8701 +size 3448608