Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support
Model Files
File Name
Size
Quantization
Format
Description
Qwen3_1.7B.F32.gguf
6.89 GB
FP32
GGUF
Full precision (float32) version
Qwen3_1.7B.BF16.gguf
3.45 GB
BF16
GGUF
BFloat16 precision version
Qwen3_1.7B.F16.gguf
3.45 GB
FP16
GGUF
Float16 precision version
Qwen3_1.7B.Q3_K_M.gguf
940 MB
Q3_K_M
GGUF
3-bit quantized (K M variant)
Qwen3_1.7B.Q3_K_S.gguf
867 MB
Q3_K_S
GGUF
3-bit quantized (K S variant)
Qwen3_1.7B.Q4_K_M.gguf
1.11 GB
Q4_K_M
GGUF
4-bit quantized (K M variant)
Qwen3_1.7B.Q4_K_S.gguf
1.06 GB
Q4_K_S
GGUF
4-bit quantized (K S variant)
Qwen3_1.7B.Q5_K_M.gguf
1.26 GB
Q5_K_M
GGUF
5-bit quantized (K M variant)
Qwen3_1.7B.Q8_0.gguf
1.83 GB
Q8_0
GGUF
8-bit quantized
.gitattributes
2.04 kB
—
—
Git LFS tracking file
config.json
31 B
—
—
Configuration placeholder
README.md
3.63 kB
—
—
Model documentation
Quants Usage
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)