commit 11bc744dd1791e8cacb0a9c205d5e273f087a670 Author: ModelHub XC Date: Sat Apr 11 19:03:55 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/TinyRP2-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..632408c --- /dev/null +++ b/.gitattributes @@ -0,0 +1,60 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +TinyRP2.imatrix.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +TinyRP2.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..06a884b --- /dev/null +++ b/README.md @@ -0,0 +1,95 @@ +--- +base_model: hamzah0asadullah/TinyRP2 +language: +- en +library_name: transformers +license: apache-2.0 +mradermacher: + readme_rev: 1 +quantized_by: mradermacher +tags: +- rp +- roleplay +- roleplaying +- role-play +- role-playing +- reasoning +- reason +- thinking +- think +--- +## About + + + + + + + + + +weighted/imatrix quants of https://huggingface.co/hamzah0asadullah/TinyRP2 + + + +***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#TinyRP2-i1-GGUF).*** + +static quants are available at https://huggingface.co/mradermacher/TinyRP2-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.imatrix.gguf) | imatrix | 0.1 | imatrix file (for creating your own quants) | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ1_S.gguf) | i1-IQ1_S | 0.3 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ1_M.gguf) | i1-IQ1_M | 0.3 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.3 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.3 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ2_S.gguf) | i1-IQ2_S | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ2_M.gguf) | i1-IQ2_M | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.4 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q2_K.gguf) | i1-Q2_K | 0.4 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ3_S.gguf) | i1-IQ3_S | 0.4 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.4 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ3_M.gguf) | i1-IQ3_M | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.4 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.5 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.5 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.5 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q4_0.gguf) | i1-Q4_0 | 0.5 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.5 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.5 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q4_1.gguf) | i1-Q4_1 | 0.5 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q5_K_S.gguf) | i1-Q5_K_S | 0.5 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q5_K_M.gguf) | i1-Q5_K_M | 0.5 | | +| [GGUF](https://huggingface.co/mradermacher/TinyRP2-i1-GGUF/resolve/main/TinyRP2.i1-Q6_K.gguf) | i1-Q6_K | 0.6 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/TinyRP2.i1-IQ1_M.gguf b/TinyRP2.i1-IQ1_M.gguf new file mode 100644 index 0000000..c850fa3 --- /dev/null +++ b/TinyRP2.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2adba55ade5d0d834505ffd43aded1f01c19e313ebad12d558b04e3d394b9e65 +size 216052512 diff --git a/TinyRP2.i1-IQ1_S.gguf b/TinyRP2.i1-IQ1_S.gguf new file mode 100644 index 0000000..fabbe09 --- /dev/null +++ b/TinyRP2.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:482d62d4db412f4b5344f08dc1e1f32178457c9066226b03052af5b1d971ea3f +size 208016160 diff --git a/TinyRP2.i1-IQ2_M.gguf b/TinyRP2.i1-IQ2_M.gguf new file mode 100644 index 0000000..653769e --- /dev/null +++ b/TinyRP2.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fe24a80075ef544aec1709c35ba9f44e4d3a078b4c03ba658564ca024b307a4d +size 264909600 diff --git a/TinyRP2.i1-IQ2_S.gguf b/TinyRP2.i1-IQ2_S.gguf new file mode 100644 index 0000000..f3ac123 --- /dev/null +++ b/TinyRP2.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:afab23e6983eccf2ff109468b872cdd617608acab8d4de6f9cc0e499b271d5fd +size 254194464 diff --git a/TinyRP2.i1-IQ2_XS.gguf b/TinyRP2.i1-IQ2_XS.gguf new file mode 100644 index 0000000..c3a1a85 --- /dev/null +++ b/TinyRP2.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:75b3064646b038ac57d709cbb6c76752efe610b08282da3d104aac2379900aef +size 241996576 diff --git a/TinyRP2.i1-IQ2_XXS.gguf b/TinyRP2.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..3b030c1 --- /dev/null +++ b/TinyRP2.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d2b4950f5cf1a8227dc48f38ecdad360d31da30c7f5a2d11f430e0dce5a9cad5 +size 229446432 diff --git a/TinyRP2.i1-IQ3_M.gguf b/TinyRP2.i1-IQ3_M.gguf new file mode 100644 index 0000000..f39e800 --- /dev/null +++ b/TinyRP2.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a8b15a1c00e2827a2e6d5e10a0b52e94e9ca7966ff1176e4e58e7af9fdebfb4 +size 336027424 diff --git a/TinyRP2.i1-IQ3_S.gguf b/TinyRP2.i1-IQ3_S.gguf new file mode 100644 index 0000000..b37d178 --- /dev/null +++ b/TinyRP2.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8a3edd9b81bd2549dee922ab4f043fae04c5f88191bc81150f9a9b26da82fac4 +size 323075872 diff --git a/TinyRP2.i1-IQ3_XS.gguf b/TinyRP2.i1-IQ3_XS.gguf new file mode 100644 index 0000000..887467f --- /dev/null +++ b/TinyRP2.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c4b13a094275559d26dc2fa2f6202606058aba64be494c6b77ffd4f3bdb7c743 +size 312753952 diff --git a/TinyRP2.i1-IQ3_XXS.gguf b/TinyRP2.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..3205d49 --- /dev/null +++ b/TinyRP2.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:eab4c207ddb1c91f7eac4cdc8f6a2dde00ba68d7e8dd637e5872bc218582cb29 +size 279016224 diff --git a/TinyRP2.i1-IQ4_NL.gguf b/TinyRP2.i1-IQ4_NL.gguf new file mode 100644 index 0000000..f90b26c --- /dev/null +++ b/TinyRP2.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0171248b4e3db5ccc8f7f8c7ed6e53fa880e58aba85664c5a56361d824be846f +size 381566752 diff --git a/TinyRP2.i1-IQ4_XS.gguf b/TinyRP2.i1-IQ4_XS.gguf new file mode 100644 index 0000000..6b1f9ae --- /dev/null +++ b/TinyRP2.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:7f408fabed1d25616b17c63166ec90ddcc4a44ac7c9cb8cd0321a3c05052eb21 +size 367804192 diff --git a/TinyRP2.i1-Q2_K.gguf b/TinyRP2.i1-Q2_K.gguf new file mode 100644 index 0000000..42f8a31 --- /dev/null +++ b/TinyRP2.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:dec1734bcb30109f4fd8387f97a5034aaaf66a109277328fdd007a23fd470a4a +size 296238880 diff --git a/TinyRP2.i1-Q2_K_S.gguf b/TinyRP2.i1-Q2_K_S.gguf new file mode 100644 index 0000000..ab416ba --- /dev/null +++ b/TinyRP2.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e13b1a553ffc4b2155074d9b053feaf3d3315d5dfd7b2d1055d9ada8528e9825 +size 280559392 diff --git a/TinyRP2.i1-Q3_K_L.gguf b/TinyRP2.i1-Q3_K_L.gguf new file mode 100644 index 0000000..9dae2bd --- /dev/null +++ b/TinyRP2.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:87d67ef17796cf913fee1c49facf700b4b17dcade710a390c0585d3d2897083b +size 368492320 diff --git a/TinyRP2.i1-Q3_K_M.gguf b/TinyRP2.i1-Q3_K_M.gguf new file mode 100644 index 0000000..368881a --- /dev/null +++ b/TinyRP2.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:708e947550bdca15aca003eb3738ccc8c5a79028601fb5e889ab6994ba8a05db +size 347127584 diff --git a/TinyRP2.i1-Q3_K_S.gguf b/TinyRP2.i1-Q3_K_S.gguf new file mode 100644 index 0000000..4ab869b --- /dev/null +++ b/TinyRP2.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e22b0ffdb9800c6435d07150fc5e453463ebf07ee7dbce0683526cd426b1dd80 +size 323075872 diff --git a/TinyRP2.i1-Q4_0.gguf b/TinyRP2.i1-Q4_0.gguf new file mode 100644 index 0000000..eac8020 --- /dev/null +++ b/TinyRP2.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:2a264ca571da3f674b1a63204b6044f6a9fce40af7e78ae4668525d4bcae856f +size 382156576 diff --git a/TinyRP2.i1-Q4_1.gguf b/TinyRP2.i1-Q4_1.gguf new file mode 100644 index 0000000..db886cc --- /dev/null +++ b/TinyRP2.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:51f74d63921480abd4bc6f500513b3226cdabd002a4ce07729d3c189534d6274 +size 409091872 diff --git a/TinyRP2.i1-Q4_K_M.gguf b/TinyRP2.i1-Q4_K_M.gguf new file mode 100644 index 0000000..6e46e26 --- /dev/null +++ b/TinyRP2.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:cb61bf89798fad87f952fb17a1921f50f254aaeb4e32c0c8eb92d3f1914df0e7 +size 396705568 diff --git a/TinyRP2.i1-Q4_K_S.gguf b/TinyRP2.i1-Q4_K_S.gguf new file mode 100644 index 0000000..20fd423 --- /dev/null +++ b/TinyRP2.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:aa409fc4651c1a4c57e69341baed955db5a86226fa6cfed6b7755bb1b3b0a276 +size 383270688 diff --git a/TinyRP2.i1-Q5_K_M.gguf b/TinyRP2.i1-Q5_K_M.gguf new file mode 100644 index 0000000..86a1798 --- /dev/null +++ b/TinyRP2.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fb0de7d9e337291e0a5aeb6a860bb89ac3819439e01ff7cb7ad6bd14883dc2d2 +size 444415776 diff --git a/TinyRP2.i1-Q5_K_S.gguf b/TinyRP2.i1-Q5_K_S.gguf new file mode 100644 index 0000000..f33cf07 --- /dev/null +++ b/TinyRP2.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8280af493362d203e61b4e8dd887f314a6f635000799d50658b89667d7374545 +size 436616992 diff --git a/TinyRP2.i1-Q6_K.gguf b/TinyRP2.i1-Q6_K.gguf new file mode 100644 index 0000000..20b4fc8 --- /dev/null +++ b/TinyRP2.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ab940fd3c90f10fe166322b816436a9b3e3d8b41295c47abedc0b5c6b91d7eca +size 495107872 diff --git a/TinyRP2.imatrix.gguf b/TinyRP2.imatrix.gguf new file mode 100644 index 0000000..6906239 --- /dev/null +++ b/TinyRP2.imatrix.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:14eb175d7ad7a4731cac6ae5de75f0a126052e96acbb3b14b1b8b12a14e1dd51 +size 1177056