Files
Qwen2.5-Coder-7B-20260219-GGUF/README.md

45 lines
1.3 KiB
Markdown
Raw Normal View History

---
base_model: unsloth/Qwen2.5-Coder-7B-Instruct-bnb-4bit
tags:
- text-generation-inference
- transformers
- unsloth
- qwen2
- swift
- python
- code
- gguf
license: cc-by-nc-4.0
language:
- en
- ja
datasets:
- igarin/swift-python-textbook-20260219
---
# Uploaded finetuned model
- **Developed by:** igarin
- **License:** cc-by-nc-4.0
- **Finetuned from model :** unsloth/Qwen2.5-Coder-7B-Instruct-bnb-4bit
# Qwen2.5-Coder-7B-20260219-GGUF : GGUF
This model was finetuned and converted to GGUF format using [Unsloth](https://github.com/unslothai/unsloth).
**Example usage**:
- For text only LLMs: `./llama.cpp/llama-cli -hf igarin/Qwen2.5-Coder-7B-20260219-GGUF --jinja`
- For multimodal models: `./llama.cpp/llama-mtmd-cli -hf igarin/Qwen2.5-Coder-7B-20260219-GGUF --jinja`
## Available Model files:
- `qwen2.5-coder-7b-instruct.F16.gguf`
- `qwen2.5-coder-7b-instruct.Q2_K.gguf`
- `qwen2.5-coder-7b-instruct.Q4_K_M.gguf`
- `qwen2.5-coder-7b-instruct.Q5_K_M.gguf`
- `qwen2.5-coder-7b-instruct.Q6_K.gguf`
- `qwen2.5-coder-7b-instruct.Q8_0.gguf`
## Ollama
An Ollama Modelfile is included for easy deployment.
This was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)