90 lines
3.1 KiB
Markdown
90 lines
3.1 KiB
Markdown
---
|
|
language:
|
|
- uz
|
|
- en
|
|
license: cc-by-nc-4.0
|
|
datasets:
|
|
- yakhyo/uz-wiki
|
|
- tahrirchi/uz-books-v2
|
|
- tahrirchi/uz-crawl
|
|
- saillab/alpaca_uzbek_taco
|
|
- behbudiy/alpaca-cleaned-uz
|
|
- UAzimov/uzbek-instruct-llm
|
|
- CohereLabs/aya_collection_language_split
|
|
- med-alex/qa_mt_ru_to_uzn
|
|
- med-alex/qa_mt_tr_to_uzn
|
|
library_name: gguf
|
|
pipeline_tag: text-generation
|
|
base_model: inspirebek/qwen3-4b-uzbek-v2
|
|
tags:
|
|
- uzbek
|
|
- qwen3
|
|
- quantized
|
|
- gguf
|
|
- llama.cpp
|
|
- ollama
|
|
---
|
|
|
|
# qwen3-4b-uzbek-v2-gguf
|
|
|
|
gguf suite for [`inspirebek/qwen3-4b-uzbek-v2`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2). cpu / apple silicon / vulkan / rocm via `llama.cpp`, ollama, lm studio, etc.
|
|
|
|
## files
|
|
|
|
| quant | size | notes |
|
|
|---|---|---|
|
|
| `f16` | 8.8 gb | reference fp16 |
|
|
| `Q8_0` | 4.7 gb | near-lossless |
|
|
| `Q6_K` | 3.6 gb | recommended for quality |
|
|
| `Q5_K_M` | 3.2 gb | balanced |
|
|
| `Q5_K_S` | 3.1 gb | slightly lighter |
|
|
| `Q4_K_M` | 2.7 gb | **recommended for most users** |
|
|
| `Q4_K_S` | 2.6 gb | smaller, slight quality loss |
|
|
| `Q3_K_M` | 2.2 gb | aggressive |
|
|
| `Q2_K` | 1.8 gb | edge / low-ram only |
|
|
|
|
## usage
|
|
|
|
**llama.cpp:**
|
|
|
|
```bash
|
|
llama-cli -m qwen3-4b-uzbek-v2-q4_k_m.gguf -p "Salom! Qalaysan?" -cnv
|
|
```
|
|
|
|
**ollama:**
|
|
|
|
```bash
|
|
ollama run hf.co/inspirebek/qwen3-4b-uzbek-v2-GGUF:Q4_K_M
|
|
```
|
|
|
|
## quantization
|
|
|
|
converted from the bf16 merged model via `llama.cpp`'s `convert_hf_to_gguf.py` → `llama-quantize`. no calibration data (k-quants are statistics-only).
|
|
|
|
## datasets
|
|
|
|
**stage a — fluency (continued pretraining):**
|
|
|
|
- [`yakhyo/uz-wiki`](https://huggingface.co/datasets/yakhyo/uz-wiki) · MIT
|
|
- [`tahrirchi/uz-books-v2`](https://huggingface.co/datasets/tahrirchi/uz-books-v2) · MIT
|
|
- [`tahrirchi/uz-crawl`](https://huggingface.co/datasets/tahrirchi/uz-crawl) · Apache-2.0
|
|
|
|
**stage b — instruct (sft):**
|
|
|
|
- [`saillab/alpaca_uzbek_taco`](https://huggingface.co/datasets/saillab/alpaca_uzbek_taco) · CC-BY-NC-4.0
|
|
- [`behbudiy/alpaca-cleaned-uz`](https://huggingface.co/datasets/behbudiy/alpaca-cleaned-uz) · CC-BY-4.0
|
|
- [`UAzimov/uzbek-instruct-llm`](https://huggingface.co/datasets/UAzimov/uzbek-instruct-llm) · Apache-2.0
|
|
- [`CohereLabs/aya_collection_language_split`](https://huggingface.co/datasets/CohereLabs/aya_collection_language_split) · Apache-2.0
|
|
- [`med-alex/qa_mt_ru_to_uzn`](https://huggingface.co/datasets/med-alex/qa_mt_ru_to_uzn) · unspecified
|
|
- [`med-alex/qa_mt_tr_to_uzn`](https://huggingface.co/datasets/med-alex/qa_mt_tr_to_uzn) · unspecified
|
|
|
|
> ⚠️ licensing note: `saillab/alpaca_uzbek_taco` is cc-by-nc-4.0, which restricts commercial use of derivative models. downstream users who need a fully permissive license should retrain without that subset.
|
|
|
|
## sibling formats
|
|
|
|
- [`inspirebek/qwen3-4b-uzbek-v2`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2)
|
|
- [`inspirebek/qwen3-4b-uzbek-v2-lora`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-lora)
|
|
- [`inspirebek/qwen3-4b-uzbek-v2-bnb-4bit`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-bnb-4bit)
|
|
- [`inspirebek/qwen3-4b-uzbek-v2-awq`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-awq)
|
|
- [`inspirebek/qwen3-4b-uzbek-v2-GGUF`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-GGUF)
|