初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF Source: Original Platform
This commit is contained in:
60
.gitattributes
vendored
Normal file
60
.gitattributes
vendored
Normal file
@@ -0,0 +1,60 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_M.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:63acf702ebdef0f1924ea39890b0beaad79dc96e414f93c6f357f4cc8a8937aa
|
||||
size 464465248
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_S.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:27059586c28a1057d7032de44c867b21819314c18c3d2bfd5b6ba84982860542
|
||||
size 436531552
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_M.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c92de2c80ecf63eea7a403dc84e07c15fc07ec3522f703af796e7ee0828e79fb
|
||||
size 601058656
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_S.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c2b9056542c229a49f120bc802ef1e86054c6b09264a78d5d5f7f5fad6fc2e56
|
||||
size 563813728
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XS.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1c5f4a29fd24ba25c46da28bc4988b6b9bc695bd8a23814ae51ff0f262d16ada
|
||||
size 550330720
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XXS.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:367d9496d89e5ad06678dbd9b2b4cadfd4c44684e9548951e43e716651d05105
|
||||
size 511021408
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_M.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ef33d7db8352af23ab557c7acf03f2a05107e9490f8939fd31e83f96cca1329e
|
||||
size 776668000
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_S.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:f95de5c72f03c2a6edd872a739e679f83be659bae6265c38333dff3c05936d11
|
||||
size 762410848
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XS.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0efd7994ad0b880e320e899cb15c5830c107647cfb82d029dc18256d7309c28e
|
||||
size 731703136
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XXS.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e25885bfd0b86693f2f5394f389f1d743d988a748edd805135d9839c8a69c814
|
||||
size 668796256
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_NL.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_NL.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:61134641bef0ae178d37a3b336d5ffe60b7fb1227c3d4a19476fa539ee9dd5bc
|
||||
size 936335200
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_XS.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2b1d49ab71fa9f5c5b7891ca16e2724e6fa69ada8617d5c33bafad3eff73cd66
|
||||
size 895735648
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4e1637f7eb9a6029b4f5ea4a13389e77d3a86c103a909720d7847f7e1b5cdbd3
|
||||
size 676308832
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K_S.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e172e9168b7d4a66eb07efb523335b76ccf4916fc8045401d451b94ce2d4174b
|
||||
size 640139104
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_L.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:807ed1e359070feb547a21cc03bedde19c1e4726e504f4a8b7b8f620686fc432
|
||||
size 880166752
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_M.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:334874ee5e19417a88fef5f707c658b22b2d01027f70bbac8ed83d1f8b2dba61
|
||||
size 824182624
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_S.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:afaf316a27c093a954396600117447be18d8ce21f8ad1a7e4eb38a99f849f2fb
|
||||
size 760948576
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_0.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:373d6f300c7c7470c257d735021be2d1dc3f2b009f5b7ab697e0bd7183d1d3b8
|
||||
size 937539424
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_1.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:1628e2aabf6ea2e28456f485ca344c6cbb36c653cf8626a9787d454e2dbda829
|
||||
size 1016846176
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_M.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:5a10daed47209c3bd274f7a6d9a95fef9695237fb58251730b92ce55df58f788
|
||||
size 986052448
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_S.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4b1d5f6dc8477a3078c8905041ec928ac467d0a9cdc1dca0b0f148855d26f92b
|
||||
size 940316512
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_M.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a65d608c7f6769c38a2d7f39ff4cfab8e4fa223f1b869e5badf1462678062861
|
||||
size 1125054304
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_S.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e3c3f0b55b818c72c45b0c04f3f1b4b6769d4d7d22c7a95566b2a714834aa989
|
||||
size 1098733408
|
||||
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q6_K.gguf
Normal file
3
Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:cd9201736f51f25b22d6dfea77c053ac5b174d92394f3aa633c069ab1d913950
|
||||
size 1272743776
|
||||
95
README.md
Normal file
95
README.md
Normal file
@@ -0,0 +1,95 @@
|
||||
---
|
||||
base_model: BeaverAI/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B
|
||||
datasets:
|
||||
- PJMixers-Dev/allura-org_gryphe-sonnet-3.5-charcards-names-added-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/anthracite-org_c2_logs_32k_llama3_qwen2_v1.3-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/grimulkan_aicg-logs-augmented-system-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/grimulkan_jannie-log-augmented-system-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/grimulkan_PIPPA-augmented-dedup-system-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/lemonilia_LimaRP-Only-NonSus-Simple-CustomShareGPT-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/MinervaAI_Aesir-Preview-Anon-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/NyxKrage_chub-logs-sharegpt-longest-CustomShareGPT-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/PocketDoc_Dans-Prosemaxx-Cowriter-XL-8192-shrunk-l3-qwq-all-aphrodite-Shuffled
|
||||
- PJMixers-Dev/PocketDoc_Dans-Personamaxx-Rainy-qwq-all-aphrodite-Shuffled
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
license: apache-2.0
|
||||
mradermacher:
|
||||
readme_rev: 1
|
||||
quantized_by: mradermacher
|
||||
tags:
|
||||
- axolotl
|
||||
- generated_from_trainer
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
weighted/imatrix quants of https://huggingface.co/BeaverAI/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B
|
||||
|
||||
<!-- provided-files -->
|
||||
|
||||
***For a convenient overview and download list, visit our [model page for this model](https://hf.tst.eu/model#Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF).***
|
||||
|
||||
static quants are available at https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_S.gguf) | i1-IQ1_S | 0.5 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ1_M.gguf) | i1-IQ1_M | 0.6 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.6 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_S.gguf) | i1-IQ2_S | 0.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ2_M.gguf) | i1-IQ2_M | 0.7 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.7 | very low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.8 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q2_K.gguf) | i1-Q2_K | 0.8 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.8 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.9 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_S.gguf) | i1-IQ3_S | 0.9 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ3_M.gguf) | i1-IQ3_M | 0.9 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.9 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q3_K_L.gguf) | i1-Q3_K_L | 1.0 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_XS.gguf) | i1-IQ4_XS | 1.0 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-IQ4_NL.gguf) | i1-IQ4_NL | 1.0 | prefer IQ4_XS |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_0.gguf) | i1-Q4_0 | 1.0 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_S.gguf) | i1-Q4_K_S | 1.0 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_K_M.gguf) | i1-Q4_K_M | 1.1 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q4_1.gguf) | i1-Q4_1 | 1.1 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_S.gguf) | i1-Q5_K_S | 1.2 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q5_K_M.gguf) | i1-Q5_K_M | 1.2 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B-i1-GGUF/resolve/main/Qwen2.5-QwQ-RP-Draft-v0.2-1.5B.i1-Q6_K.gguf) | i1-Q6_K | 1.4 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
3
imatrix.dat
Normal file
3
imatrix.dat
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:feaf07b1096b0ebc1b3c887d23ba12b3c6c666d698d1a832eb75bf8cd319f966
|
||||
size 2042201
|
||||
Reference in New Issue
Block a user