commit 3a5022b6ba017e8ecc6840fe1693fb7dcb8c894c Author: ModelHub XC Date: Wed Jun 17 16:04:18 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: duyntnet/Qwen2.5-Coder-1.5B-imatrix-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..9c70e33 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,62 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q2_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q4_1.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q5_1.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Qwen2.5-Coder-1.5B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Qwen2.5-Coder-1.5B-IQ1_M.gguf b/Qwen2.5-Coder-1.5B-IQ1_M.gguf new file mode 100644 index 0000000..3fe01b9 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ba7ce718790a28f96cc1c0c50693cde9cf1c99c5b65c3d1c301c14143f764a13 +size 464461952 diff --git a/Qwen2.5-Coder-1.5B-IQ1_S.gguf b/Qwen2.5-Coder-1.5B-IQ1_S.gguf new file mode 100644 index 0000000..3ea3b77 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9b8f4470c01ae7a4ae01757a0d11d890d01805203cf975d7cd2d14da05aca729 +size 436528256 diff --git a/Qwen2.5-Coder-1.5B-IQ2_M.gguf b/Qwen2.5-Coder-1.5B-IQ2_M.gguf new file mode 100644 index 0000000..3727f7d --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:09fa5392b4ee6838ad336e57156a5e49d19b3a97f5efc65246d4dbbee093661e +size 601055360 diff --git a/Qwen2.5-Coder-1.5B-IQ2_S.gguf b/Qwen2.5-Coder-1.5B-IQ2_S.gguf new file mode 100644 index 0000000..0a3e9ae --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:550730976bbfc442d1a353ebd4276c3d23a1c216e14657159c7ab95af35ef846 +size 563810432 diff --git a/Qwen2.5-Coder-1.5B-IQ2_XS.gguf b/Qwen2.5-Coder-1.5B-IQ2_XS.gguf new file mode 100644 index 0000000..7d0dba1 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9bc4f0cfafd4027f3c3b951b14c96a193b8fbb8003a6dfda45474534df177123 +size 550327424 diff --git a/Qwen2.5-Coder-1.5B-IQ2_XXS.gguf b/Qwen2.5-Coder-1.5B-IQ2_XXS.gguf new file mode 100644 index 0000000..2f05e95 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f0d7c52914edc63a1deef18ec06fb9c01928c3e6731f922c8934d2da6eb27895 +size 511018112 diff --git a/Qwen2.5-Coder-1.5B-IQ3_M.gguf b/Qwen2.5-Coder-1.5B-IQ3_M.gguf new file mode 100644 index 0000000..b9f653c --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8acf2e5db980fe3b76edcd9afd485b79fefe2b6eec29a82cfa3b45863c594b22 +size 776664704 diff --git a/Qwen2.5-Coder-1.5B-IQ3_S.gguf b/Qwen2.5-Coder-1.5B-IQ3_S.gguf new file mode 100644 index 0000000..bbe688b --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:85eb220b192e715ee8161b1e828eb7603c0292d9bc9d3acd3ce223dc9c38c4b0 +size 762407552 diff --git a/Qwen2.5-Coder-1.5B-IQ3_XS.gguf b/Qwen2.5-Coder-1.5B-IQ3_XS.gguf new file mode 100644 index 0000000..c8bd861 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:c55180d930a3fc2952f161295c1a2a8a6adda2fcffb603ffed9602bf769734bf +size 731699840 diff --git a/Qwen2.5-Coder-1.5B-IQ3_XXS.gguf b/Qwen2.5-Coder-1.5B-IQ3_XXS.gguf new file mode 100644 index 0000000..0b7c57b --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b7a08471c6c8ecfa18cd677f28422e72cb9693dd0ee55c9f8d5fc5fe7aa8ea53 +size 668792960 diff --git a/Qwen2.5-Coder-1.5B-IQ4_NL.gguf b/Qwen2.5-Coder-1.5B-IQ4_NL.gguf new file mode 100644 index 0000000..03bd4ad --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ4_NL.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:40c53f85c39959e6654d1a5f6a5b1d5902809667f1974cea2e6e418361500ca9 +size 936331904 diff --git a/Qwen2.5-Coder-1.5B-IQ4_XS.gguf b/Qwen2.5-Coder-1.5B-IQ4_XS.gguf new file mode 100644 index 0000000..07a2513 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b6cc67be2fc93de3d52e419129d594e4efe17d88326ce19052060550eee526f1 +size 895732352 diff --git a/Qwen2.5-Coder-1.5B-Q2_K.gguf b/Qwen2.5-Coder-1.5B-Q2_K.gguf new file mode 100644 index 0000000..1b6f2a5 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1c0af205020efcc97041d03db84cb022b50e19a8998e47894cde11b99045f52e +size 676305536 diff --git a/Qwen2.5-Coder-1.5B-Q2_K_S.gguf b/Qwen2.5-Coder-1.5B-Q2_K_S.gguf new file mode 100644 index 0000000..d4d8121 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q2_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fa3ed744fb0a2ac465649348da8ac45033f5d94206b55bd93dca0c565f5e24cb +size 640135808 diff --git a/Qwen2.5-Coder-1.5B-Q3_K_L.gguf b/Qwen2.5-Coder-1.5B-Q3_K_L.gguf new file mode 100644 index 0000000..1486aa2 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:49501ac38f45399e076562f9584c9ff17ef59a7569d7aeee94d2c3f8a17ad4f1 +size 880163456 diff --git a/Qwen2.5-Coder-1.5B-Q3_K_M.gguf b/Qwen2.5-Coder-1.5B-Q3_K_M.gguf new file mode 100644 index 0000000..67767ce --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4cf13d3cc93a58a07b6f08dbb52e38b532e7763d9a5723d5bc40a233f2d18f39 +size 824179328 diff --git a/Qwen2.5-Coder-1.5B-Q3_K_S.gguf b/Qwen2.5-Coder-1.5B-Q3_K_S.gguf new file mode 100644 index 0000000..13d11ee --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8176883b711194f2c69ad90217f06e1033d7106cb4e1ec0c938c9d56d2fbd521 +size 760945280 diff --git a/Qwen2.5-Coder-1.5B-Q4_0.gguf b/Qwen2.5-Coder-1.5B-Q4_0.gguf new file mode 100644 index 0000000..560d81f --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:fb696064e565fd281ad0ee2b5f989048a32f0f7ccd8118cdec82f62d604cd729 +size 937536128 diff --git a/Qwen2.5-Coder-1.5B-Q4_1.gguf b/Qwen2.5-Coder-1.5B-Q4_1.gguf new file mode 100644 index 0000000..1e56901 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q4_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:abff6afee560d6a65caffa885d8d2835e312531eee4320e98d778453b6d5720d +size 1016842880 diff --git a/Qwen2.5-Coder-1.5B-Q4_K_M.gguf b/Qwen2.5-Coder-1.5B-Q4_K_M.gguf new file mode 100644 index 0000000..cc1f716 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1e837152fd3135263fa88b4f513efadbb5dde2b5c229a10587dddedc0c0578c8 +size 986049152 diff --git a/Qwen2.5-Coder-1.5B-Q4_K_S.gguf b/Qwen2.5-Coder-1.5B-Q4_K_S.gguf new file mode 100644 index 0000000..27fc370 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:150c3732d2e6c5c18146ddd7e1a9f797a73df6bb8419c2c1b90ae05818dbc562 +size 940313216 diff --git a/Qwen2.5-Coder-1.5B-Q5_0.gguf b/Qwen2.5-Coder-1.5B-Q5_0.gguf new file mode 100644 index 0000000..c14e293 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q5_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:01a4baad236b43dbb17ccf2603ff6b64bcc5da49eaec33b678dcb36cdf0d5d78 +size 1101310592 diff --git a/Qwen2.5-Coder-1.5B-Q5_1.gguf b/Qwen2.5-Coder-1.5B-Q5_1.gguf new file mode 100644 index 0000000..8cd4a0f --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q5_1.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:da858b301d0f0dd2a2507a7ea38a8fef0123a8fb209566fc47aaf5f628de63d3 +size 1180617344 diff --git a/Qwen2.5-Coder-1.5B-Q5_K_M.gguf b/Qwen2.5-Coder-1.5B-Q5_K_M.gguf new file mode 100644 index 0000000..b703dc3 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3d4ef61405fee8c937681141da97a625889439a1e850cc9f43dc3b0521302efe +size 1125051008 diff --git a/Qwen2.5-Coder-1.5B-Q5_K_S.gguf b/Qwen2.5-Coder-1.5B-Q5_K_S.gguf new file mode 100644 index 0000000..6ab8803 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:116fc3bc98ec69d94f245ed88e747fedf89c407f5af995aa639240b151b1fd7a +size 1098730112 diff --git a/Qwen2.5-Coder-1.5B-Q6_K.gguf b/Qwen2.5-Coder-1.5B-Q6_K.gguf new file mode 100644 index 0000000..7782ac9 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ab04eb20379db64e4f2ead89b2441bc31f1844790115de089bf46683b88c61ee +size 1272740480 diff --git a/Qwen2.5-Coder-1.5B-Q8_0.gguf b/Qwen2.5-Coder-1.5B-Q8_0.gguf new file mode 100644 index 0000000..1a0d590 --- /dev/null +++ b/Qwen2.5-Coder-1.5B-Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d227aa6d958d92550d3b30846f7fdad0459262e016a2bc4b3cda52ea7d24230b +size 1646573696 diff --git a/README.md b/README.md new file mode 100644 index 0000000..55d6952 --- /dev/null +++ b/README.md @@ -0,0 +1,55 @@ +--- +license: other +language: +- en +pipeline_tag: text-generation +inference: false +tags: +- transformers +- gguf +- imatrix +- Qwen2.5-Coder-1.5B +--- +Quantizations of https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B + + +### Inference Clients/UIs +* [llama.cpp](https://github.com/ggerganov/llama.cpp) +* [KoboldCPP](https://github.com/LostRuins/koboldcpp) +* [ollama](https://github.com/ollama/ollama) +* [text-generation-webui](https://github.com/oobabooga/text-generation-webui) +* [GPT4All](https://github.com/nomic-ai/gpt4all) +* [jan](https://github.com/janhq/jan) +--- + +# From original readme + +## Introduction + +Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: + +- Significantly improvements in **code generation**, **code reasoning** and **code fixing**. Base on the strong Qwen2.5, we scale up the training tokens into 5.5 trillion including source code, text-code grounding, Synthetic data, etc. Qwen2.5-Coder-32B has become the current state-of-the-art open-source codeLLM, with its coding abilities matching those of GPT-4o. +- A more comprehensive foundation for real-world applications such as **Code Agents**. Not only enhancing coding capabilities but also maintaining its strengths in mathematics and general competencies. + +**This repo contains the 1.5B Qwen2.5-Coder model**, which has the following features: +- Type: Causal Language Models +- Training Stage: Pretraining +- Architecture: transformers with RoPE, SwiGLU, RMSNorm, Attention QKV bias and tied word embeddings +- Number of Parameters: 1.54B +- Number of Paramaters (Non-Embedding): 1.31B +- Number of Layers: 28 +- Number of Attention Heads (GQA): 12 for Q and 2 for KV +- Context Length: Full 32,768 tokens + +**We do not recommend using base language models for conversations.** Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., or fill in the middle tasks on this model. + +For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2.5-coder-family/), [GitHub](https://github.com/QwenLM/Qwen2.5-Coder), [Documentation](https://qwen.readthedocs.io/en/latest/), [Arxiv](https://arxiv.org/abs/2409.12186). + +## Requirements + +The code of Qwen2.5-Coder has been in the latest Hugging face `transformers` and we advise you to use the latest version of `transformers`. + +With `transformers<4.37.0`, you will encounter the following error: +``` +KeyError: 'qwen2' +``` \ No newline at end of file