commit fcf2db3c91eff2cd456fe1c8ab335a62bc213e98 Author: ModelHub XC Date: Sat May 9 23:50:09 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/QWQ-500M-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..d93ec69 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,59 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +QWQ-500M.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/QWQ-500M.i1-IQ1_M.gguf b/QWQ-500M.i1-IQ1_M.gguf new file mode 100644 index 0000000..586e23b --- /dev/null +++ b/QWQ-500M.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3e8c18e5c0988c6fdcd6accf943d17bf28b107a2f3435509c5980994657e3351 +size 317975648 diff --git a/QWQ-500M.i1-IQ1_S.gguf b/QWQ-500M.i1-IQ1_S.gguf new file mode 100644 index 0000000..bc8bca8 --- /dev/null +++ b/QWQ-500M.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d2b8a99255a98a24467e77b09d7f2824eca56a03244c2ddb1cac44eaab09dceb +size 315830624 diff --git a/QWQ-500M.i1-IQ2_M.gguf b/QWQ-500M.i1-IQ2_M.gguf new file mode 100644 index 0000000..c050b3c --- /dev/null +++ b/QWQ-500M.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d9791db1fcde1d5fdae1db604d3d947c67c7576a05e4674425e74cd7ccf4d11e +size 328598624 diff --git a/QWQ-500M.i1-IQ2_S.gguf b/QWQ-500M.i1-IQ2_S.gguf new file mode 100644 index 0000000..c784dd9 --- /dev/null +++ b/QWQ-500M.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:291bea87b6789b2695e73b97d9171ab0f298bd61adabc8323a060f649edd6817 +size 325738592 diff --git a/QWQ-500M.i1-IQ2_XS.gguf b/QWQ-500M.i1-IQ2_XS.gguf new file mode 100644 index 0000000..a3cc8cc --- /dev/null +++ b/QWQ-500M.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5a6e806ac8c9e0d3aef7607ac36194857b88d3699b3a4d20f9dea0d9ca815426 +size 324410720 diff --git a/QWQ-500M.i1-IQ2_XXS.gguf b/QWQ-500M.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..89f77e1 --- /dev/null +++ b/QWQ-500M.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:0393753b96a826de9328509dfe7d72e58194f92359b22265c88472d76f077e32 +size 321550688 diff --git a/QWQ-500M.i1-IQ3_M.gguf b/QWQ-500M.i1-IQ3_M.gguf new file mode 100644 index 0000000..855f460 --- /dev/null +++ b/QWQ-500M.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:950265405c4b779de5621b61bd33221a439ceb8a96c246f6152eafb2a1dedec4 +size 342753632 diff --git a/QWQ-500M.i1-IQ3_S.gguf b/QWQ-500M.i1-IQ3_S.gguf new file mode 100644 index 0000000..fa9d5b8 --- /dev/null +++ b/QWQ-500M.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c4b987f28d35435c231c4161b03ce998c0464305804497b4cd387ec4eeeeb465 +size 338608736 diff --git a/QWQ-500M.i1-IQ3_XS.gguf b/QWQ-500M.i1-IQ3_XS.gguf new file mode 100644 index 0000000..5cfe853 --- /dev/null +++ b/QWQ-500M.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9ff5196e025e6da654daa1c0da345c3a109131d4337b79d5ef92319ab86d81c2 +size 338608736 diff --git a/QWQ-500M.i1-IQ3_XXS.gguf b/QWQ-500M.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..10ac3ad --- /dev/null +++ b/QWQ-500M.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bb472a91f5842e053586f82e49926d7fdd41e04be2b76d6a8b5936c447547d2b +size 333705824 diff --git a/QWQ-500M.i1-IQ4_NL.gguf b/QWQ-500M.i1-IQ4_NL.gguf new file mode 100644 index 0000000..15451f6 --- /dev/null +++ b/QWQ-500M.i1-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ccfcc9a09d5d6c083f4b86ee63671eea74c6f17ffd6585434100a69166f54f29 +size 352672352 diff --git a/QWQ-500M.i1-IQ4_XS.gguf b/QWQ-500M.i1-IQ4_XS.gguf new file mode 100644 index 0000000..38ce94a --- /dev/null +++ b/QWQ-500M.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ed86d36f02d0cb074a8b3711c45054eac0fff3c7987f306bb7dd93226cacd45f +size 349403744 diff --git a/QWQ-500M.i1-Q2_K.gguf b/QWQ-500M.i1-Q2_K.gguf new file mode 100644 index 0000000..0398d90 --- /dev/null +++ b/QWQ-500M.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c17bb95a789a9dedb3ef615c5d9ec2f4628a0d95bb725c5549d464c1e4c1860f +size 338608736 diff --git a/QWQ-500M.i1-Q2_K_S.gguf b/QWQ-500M.i1-Q2_K_S.gguf new file mode 100644 index 0000000..fba0ac0 --- /dev/null +++ b/QWQ-500M.i1-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4b89e9473c11d3c0994b3827e389efc9bebc141b3ac908cbd295e62bb8f7e124 +size 331050080 diff --git a/QWQ-500M.i1-Q3_K_L.gguf b/QWQ-500M.i1-Q3_K_L.gguf new file mode 100644 index 0000000..7c3cfb8 --- /dev/null +++ b/QWQ-500M.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c79630b0e9530638f2abb977dc86cc68f6f492e96034842e1a45d9a7a9eb8476 +size 369359456 diff --git a/QWQ-500M.i1-Q3_K_M.gguf b/QWQ-500M.i1-Q3_K_M.gguf new file mode 100644 index 0000000..f3d9580 --- /dev/null +++ b/QWQ-500M.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4f5747dd3f0743023290995690f93393ec4f89daa9a84875c28c121afa2a49fc +size 355467872 diff --git a/QWQ-500M.i1-Q3_K_S.gguf b/QWQ-500M.i1-Q3_K_S.gguf new file mode 100644 index 0000000..7d539e3 --- /dev/null +++ b/QWQ-500M.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e9d61b888660aff14e5c0e5b3400f2dfb7ea380a1389b3502643b156fbfad7ca +size 338264672 diff --git a/QWQ-500M.i1-Q4_0.gguf b/QWQ-500M.i1-Q4_0.gguf new file mode 100644 index 0000000..0dd1e93 --- /dev/null +++ b/QWQ-500M.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c43b52075cd43e38a2c4de1a8018a51361c6b374172b6c7c9db6a61059d9dd7b +size 352973408 diff --git a/QWQ-500M.i1-Q4_1.gguf b/QWQ-500M.i1-Q4_1.gguf new file mode 100644 index 0000000..bcaa470 --- /dev/null +++ b/QWQ-500M.i1-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:79919ced73f0a370c1e517eab141e378545f85825c1f475791f7c5f262c1490a +size 374520416 diff --git a/QWQ-500M.i1-Q4_K_M.gguf b/QWQ-500M.i1-Q4_K_M.gguf new file mode 100644 index 0000000..0adc389 --- /dev/null +++ b/QWQ-500M.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c7fa7a4586ab90051612604b5b9363aa06873f7610708aa7007bd7b36f5b5c57 +size 397809248 diff --git a/QWQ-500M.i1-Q4_K_S.gguf b/QWQ-500M.i1-Q4_K_S.gguf new file mode 100644 index 0000000..009208e --- /dev/null +++ b/QWQ-500M.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:eda75270a9448060359768c7e12197da8a9c3d50b1d0d073d9e55c24f5266231 +size 385473120 diff --git a/QWQ-500M.i1-Q5_K_M.gguf b/QWQ-500M.i1-Q5_K_M.gguf new file mode 100644 index 0000000..03b8c63 --- /dev/null +++ b/QWQ-500M.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ffbe5c1f6c4efd3b82dec05f6737b6f1000e78945e811af5f045d2c7a2c60919 +size 420087392 diff --git a/QWQ-500M.i1-Q5_K_S.gguf b/QWQ-500M.i1-Q5_K_S.gguf new file mode 100644 index 0000000..7009791 --- /dev/null +++ b/QWQ-500M.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:da567640ad3fd946ad12eff71f864220a1c3b5fd87967d2968b9fceaf4073404 +size 412711520 diff --git a/QWQ-500M.i1-Q6_K.gguf b/QWQ-500M.i1-Q6_K.gguf new file mode 100644 index 0000000..109e8bc --- /dev/null +++ b/QWQ-500M.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8596001769cb98b636d9d5df709fb70aeb3497c7f19a0701b1c635ef2fb131b5 +size 505737824 diff --git a/README.md b/README.md new file mode 100644 index 0000000..7e541da --- /dev/null +++ b/README.md @@ -0,0 +1,83 @@ +--- +base_model: prithivMLmods/QWQ-500M +datasets: +- gghfez/QwQ-LongCoT-130K-cleaned +- qingy2024/QwQ-LongCoT-Verified-130K +- amphora/QwQ-LongCoT-130K +language: +- en +library_name: transformers +license: apache-2.0 +quantized_by: mradermacher +tags: +- qwq +- reasoning +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/prithivMLmods/QWQ-500M + + +static quants are available at https://huggingface.co/mradermacher/QWQ-500M-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ1_S.gguf) | i1-IQ1_S | 0.4 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ1_M.gguf) | i1-IQ1_M | 0.4 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_S.gguf) | i1-IQ2_S | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_M.gguf) | i1-IQ2_M | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.4 | very low quality | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.4 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_S.gguf) | i1-IQ3_S | 0.4 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q2_K.gguf) | i1-Q2_K | 0.4 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_M.gguf) | i1-IQ3_M | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.4 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.5 | prefer IQ4_XS | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_0.gguf) | i1-Q4_0 | 0.5 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.5 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.5 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_1.gguf) | i1-Q4_1 | 0.5 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.5 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.5 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q5_K_S.gguf) | i1-Q5_K_S | 0.5 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q5_K_M.gguf) | i1-Q5_K_M | 0.5 | | +| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q6_K.gguf) | i1-Q6_K | 0.6 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + +