---
license: apache-2.0
tags:
- 'Thinking: Disabled'
- text-generation-inference
language:
- en
base_model:
- prithivMLmods/Omega-Qwen2.5-Coder-3B
pipeline_tag: text-generation
library_name: transformers
---

# **Omega-Qwen2.5-Coder-3B-GGUF**

> Omega-Qwen2.5-Coder-3B is a compact and high-efficiency code-focused model fine-tuned on Qwen2.5-Coder-3B-Instruct, using the symbolic-rich Open-Omega-Forge-1M dataset. Designed specifically for hard-coded tasks and deterministic computation, this model runs in a "thinking-disabled" mode—delivering precise, structured outputs with minimal hallucination, making it ideal for rigorous coding workflows and embedded logic applications.

## Model Files

| File Name | Size | Precision |
|-----------|------|-----------|
| Omega-Qwen2.5-Coder-3B.BF16.gguf | 6.18 GB | BF16 |
| Omega-Qwen2.5-Coder-3B.F16.gguf | 6.18 GB | F16 |
| Omega-Qwen2.5-Coder-3B.F32.gguf | 12.3 GB | F32 |
| Omega-Qwen2.5-Coder-3B.Q2_K.gguf | 1.27 GB | Q2_K |
| Omega-Qwen2.5-Coder-3B.Q3_K_L.gguf | 1.71 GB | Q3_K_L |
| Omega-Qwen2.5-Coder-3B.Q3_K_M.gguf | 1.59 GB | Q3_K_M |
| Omega-Qwen2.5-Coder-3B.Q3_K_S.gguf | 1.45 GB | Q3_K_S |
| Omega-Qwen2.5-Coder-3B.Q4_K_M.gguf | 1.93 GB | Q4_K_M |
| Omega-Qwen2.5-Coder-3B.Q4_K_S.gguf | 1.83 GB | Q4_K_S |
| Omega-Qwen2.5-Coder-3B.Q5_K_M.gguf | 2.22 GB | Q5_K_M |
| Omega-Qwen2.5-Coder-3B.Q5_K_S.gguf | 2.17 GB | Q5_K_S |
| Omega-Qwen2.5-Coder-3B.Q6_K.gguf | 2.54 GB | Q6_K |
| Omega-Qwen2.5-Coder-3B.Q8_0.gguf | 3.29 GB | Q8_0 |

## Quants Usage 

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):

![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)