初始化项目,由ModelHub XC社区提供模型
Model: mradermacher/QWQ-500M-i1-GGUF Source: Original Platform
This commit is contained in:
59
.gitattributes
vendored
Normal file
59
.gitattributes
vendored
Normal file
@@ -0,0 +1,59 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
QWQ-500M.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
QWQ-500M.i1-IQ1_M.gguf
Normal file
3
QWQ-500M.i1-IQ1_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3e8c18e5c0988c6fdcd6accf943d17bf28b107a2f3435509c5980994657e3351
|
||||
size 317975648
|
||||
3
QWQ-500M.i1-IQ1_S.gguf
Normal file
3
QWQ-500M.i1-IQ1_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d2b8a99255a98a24467e77b09d7f2824eca56a03244c2ddb1cac44eaab09dceb
|
||||
size 315830624
|
||||
3
QWQ-500M.i1-IQ2_M.gguf
Normal file
3
QWQ-500M.i1-IQ2_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:d9791db1fcde1d5fdae1db604d3d947c67c7576a05e4674425e74cd7ccf4d11e
|
||||
size 328598624
|
||||
3
QWQ-500M.i1-IQ2_S.gguf
Normal file
3
QWQ-500M.i1-IQ2_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:291bea87b6789b2695e73b97d9171ab0f298bd61adabc8323a060f649edd6817
|
||||
size 325738592
|
||||
3
QWQ-500M.i1-IQ2_XS.gguf
Normal file
3
QWQ-500M.i1-IQ2_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:5a6e806ac8c9e0d3aef7607ac36194857b88d3699b3a4d20f9dea0d9ca815426
|
||||
size 324410720
|
||||
3
QWQ-500M.i1-IQ2_XXS.gguf
Normal file
3
QWQ-500M.i1-IQ2_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:0393753b96a826de9328509dfe7d72e58194f92359b22265c88472d76f077e32
|
||||
size 321550688
|
||||
3
QWQ-500M.i1-IQ3_M.gguf
Normal file
3
QWQ-500M.i1-IQ3_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:950265405c4b779de5621b61bd33221a439ceb8a96c246f6152eafb2a1dedec4
|
||||
size 342753632
|
||||
3
QWQ-500M.i1-IQ3_S.gguf
Normal file
3
QWQ-500M.i1-IQ3_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c4b987f28d35435c231c4161b03ce998c0464305804497b4cd387ec4eeeeb465
|
||||
size 338608736
|
||||
3
QWQ-500M.i1-IQ3_XS.gguf
Normal file
3
QWQ-500M.i1-IQ3_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:9ff5196e025e6da654daa1c0da345c3a109131d4337b79d5ef92319ab86d81c2
|
||||
size 338608736
|
||||
3
QWQ-500M.i1-IQ3_XXS.gguf
Normal file
3
QWQ-500M.i1-IQ3_XXS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:bb472a91f5842e053586f82e49926d7fdd41e04be2b76d6a8b5936c447547d2b
|
||||
size 333705824
|
||||
3
QWQ-500M.i1-IQ4_NL.gguf
Normal file
3
QWQ-500M.i1-IQ4_NL.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ccfcc9a09d5d6c083f4b86ee63671eea74c6f17ffd6585434100a69166f54f29
|
||||
size 352672352
|
||||
3
QWQ-500M.i1-IQ4_XS.gguf
Normal file
3
QWQ-500M.i1-IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ed86d36f02d0cb074a8b3711c45054eac0fff3c7987f306bb7dd93226cacd45f
|
||||
size 349403744
|
||||
3
QWQ-500M.i1-Q2_K.gguf
Normal file
3
QWQ-500M.i1-Q2_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c17bb95a789a9dedb3ef615c5d9ec2f4628a0d95bb725c5549d464c1e4c1860f
|
||||
size 338608736
|
||||
3
QWQ-500M.i1-Q2_K_S.gguf
Normal file
3
QWQ-500M.i1-Q2_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4b89e9473c11d3c0994b3827e389efc9bebc141b3ac908cbd295e62bb8f7e124
|
||||
size 331050080
|
||||
3
QWQ-500M.i1-Q3_K_L.gguf
Normal file
3
QWQ-500M.i1-Q3_K_L.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c79630b0e9530638f2abb977dc86cc68f6f492e96034842e1a45d9a7a9eb8476
|
||||
size 369359456
|
||||
3
QWQ-500M.i1-Q3_K_M.gguf
Normal file
3
QWQ-500M.i1-Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4f5747dd3f0743023290995690f93393ec4f89daa9a84875c28c121afa2a49fc
|
||||
size 355467872
|
||||
3
QWQ-500M.i1-Q3_K_S.gguf
Normal file
3
QWQ-500M.i1-Q3_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e9d61b888660aff14e5c0e5b3400f2dfb7ea380a1389b3502643b156fbfad7ca
|
||||
size 338264672
|
||||
3
QWQ-500M.i1-Q4_0.gguf
Normal file
3
QWQ-500M.i1-Q4_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c43b52075cd43e38a2c4de1a8018a51361c6b374172b6c7c9db6a61059d9dd7b
|
||||
size 352973408
|
||||
3
QWQ-500M.i1-Q4_1.gguf
Normal file
3
QWQ-500M.i1-Q4_1.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:79919ced73f0a370c1e517eab141e378545f85825c1f475791f7c5f262c1490a
|
||||
size 374520416
|
||||
3
QWQ-500M.i1-Q4_K_M.gguf
Normal file
3
QWQ-500M.i1-Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:c7fa7a4586ab90051612604b5b9363aa06873f7610708aa7007bd7b36f5b5c57
|
||||
size 397809248
|
||||
3
QWQ-500M.i1-Q4_K_S.gguf
Normal file
3
QWQ-500M.i1-Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:eda75270a9448060359768c7e12197da8a9c3d50b1d0d073d9e55c24f5266231
|
||||
size 385473120
|
||||
3
QWQ-500M.i1-Q5_K_M.gguf
Normal file
3
QWQ-500M.i1-Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ffbe5c1f6c4efd3b82dec05f6737b6f1000e78945e811af5f045d2c7a2c60919
|
||||
size 420087392
|
||||
3
QWQ-500M.i1-Q5_K_S.gguf
Normal file
3
QWQ-500M.i1-Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:da567640ad3fd946ad12eff71f864220a1c3b5fd87967d2968b9fceaf4073404
|
||||
size 412711520
|
||||
3
QWQ-500M.i1-Q6_K.gguf
Normal file
3
QWQ-500M.i1-Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:8596001769cb98b636d9d5df709fb70aeb3497c7f19a0701b1c635ef2fb131b5
|
||||
size 505737824
|
||||
83
README.md
Normal file
83
README.md
Normal file
@@ -0,0 +1,83 @@
|
||||
---
|
||||
base_model: prithivMLmods/QWQ-500M
|
||||
datasets:
|
||||
- gghfez/QwQ-LongCoT-130K-cleaned
|
||||
- qingy2024/QwQ-LongCoT-Verified-130K
|
||||
- amphora/QwQ-LongCoT-130K
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
license: apache-2.0
|
||||
quantized_by: mradermacher
|
||||
tags:
|
||||
- qwq
|
||||
- reasoning
|
||||
---
|
||||
## About
|
||||
|
||||
<!-- ### quantize_version: 2 -->
|
||||
<!-- ### output_tensor_quantised: 1 -->
|
||||
<!-- ### convert_type: hf -->
|
||||
<!-- ### vocab_type: -->
|
||||
<!-- ### tags: nicoboss -->
|
||||
weighted/imatrix quants of https://huggingface.co/prithivMLmods/QWQ-500M
|
||||
|
||||
<!-- provided-files -->
|
||||
static quants are available at https://huggingface.co/mradermacher/QWQ-500M-GGUF
|
||||
## Usage
|
||||
|
||||
If you are unsure how to use GGUF files, refer to one of [TheBloke's
|
||||
READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
|
||||
more details, including on how to concatenate multi-part files.
|
||||
|
||||
## Provided Quants
|
||||
|
||||
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
|
||||
|
||||
| Link | Type | Size/GB | Notes |
|
||||
|:-----|:-----|--------:|:------|
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ1_S.gguf) | i1-IQ1_S | 0.4 | for the desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ1_M.gguf) | i1-IQ1_M | 0.4 | mostly desperate |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_XS.gguf) | i1-IQ2_XS | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_S.gguf) | i1-IQ2_S | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ2_M.gguf) | i1-IQ2_M | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q2_K_S.gguf) | i1-Q2_K_S | 0.4 | very low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 0.4 | lower quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q3_K_S.gguf) | i1-Q3_K_S | 0.4 | IQ3_XS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_S.gguf) | i1-IQ3_S | 0.4 | beats Q3_K* |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_XS.gguf) | i1-IQ3_XS | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q2_K.gguf) | i1-Q2_K | 0.4 | IQ3_XXS probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ3_M.gguf) | i1-IQ3_M | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ4_XS.gguf) | i1-IQ4_XS | 0.4 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-IQ4_NL.gguf) | i1-IQ4_NL | 0.5 | prefer IQ4_XS |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_0.gguf) | i1-Q4_0 | 0.5 | fast, low quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q3_K_M.gguf) | i1-Q3_K_M | 0.5 | IQ3_S probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q3_K_L.gguf) | i1-Q3_K_L | 0.5 | IQ3_M probably better |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_1.gguf) | i1-Q4_1 | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_K_S.gguf) | i1-Q4_K_S | 0.5 | optimal size/speed/quality |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q4_K_M.gguf) | i1-Q4_K_M | 0.5 | fast, recommended |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q5_K_S.gguf) | i1-Q5_K_S | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q5_K_M.gguf) | i1-Q5_K_M | 0.5 | |
|
||||
| [GGUF](https://huggingface.co/mradermacher/QWQ-500M-i1-GGUF/resolve/main/QWQ-500M.i1-Q6_K.gguf) | i1-Q6_K | 0.6 | practically like static Q6_K |
|
||||
|
||||
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
||||
types (lower is better):
|
||||
|
||||

|
||||
|
||||
And here are Artefact2's thoughts on the matter:
|
||||
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
|
||||
|
||||
## FAQ / Model Request
|
||||
|
||||
See https://huggingface.co/mradermacher/model_requests for some answers to
|
||||
questions you might have and/or if you want some other model quantized.
|
||||
|
||||
## Thanks
|
||||
|
||||
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
||||
me use its servers and providing upgrades to my workstation to enable
|
||||
this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
|
||||
|
||||
<!-- end -->
|
||||
Reference in New Issue
Block a user