Files
Lyra-Uz/README.md
ModelHub XC 171015779a 初始化项目,由ModelHub XC社区提供模型
Model: Abduqodir06/Lyra-Uz
Source: Original Platform
2026-05-19 03:06:54 +08:00

66 lines
2.4 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
license: apache-2.0
language:
- uz
- en
base_model: mistralai/Mistral-7B-Instruct-v0.3
library_name: transformers
tags:
- text-generation-inference
- summarization
- translation
- question-answering
pipeline_tag: text-generation
---
# LYRA-Uz
**LYRA-Uz** — ozbek tilida yuqori sifatli korsatmalarni bajaruvchi (instruction-tuned) ochiq manbali til modeli. **Mistral-7B-Instruct-v0.3** arxitekturasi asosida ozbek va ingliz tillaridagi maʼlumotlar bilan maxsus oʻqitilgan. Apache 2.0 litsenziyasi bilan erkin foydalanish mumkin.
## Asosiy imkoniyatlari
- **Savol-javob** — ozbek tilidagi umumiy bilim savollariga javob beradi
- **Matnni umumlashtirish** — uzoq matnlarni qisqacha bayon qiladi
- **Tarjima** — ozbek ↔ ingliz tillari ortasida tarjima qiladi
- **Matn tasnifi** — yangiliklar, hissiy tahlil va boshqa toifalarga ajratadi
- **Korsatmalarni bajarish** — berilgan vazifani ingliz va ozbek tillarida tushunib, bajaradi
## Tezkor ishlatish
```markdown
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Abduqodir06/Lyra-Uz")
model = AutoModelForCausalLM.from_pretrained(
"Abduqodir06/Lyra-Uz",
load_in_4bit=True,
device_map="auto"
)
prompt = "O'zbekiston poytaxti qaysi?"
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
outputs = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```
## LYRA loyihasi haqida
Ushbu model **LYRA (Large Uzbek Language Reasoning Architecture)** loyihasining bir qismidir. Toliq loyiha quyidagi bosqichlarni oz ichiga oladi (hozirda ustida ish olib borilmoqda):
1. **Tokenizer optimallashtirish** — ozbek tili morfologiyasiga mos BPE
2. **RAG va veb-qidiruv** — bilimlarni real vaqtda qidirib javob berish
3. **Deploy** — FastAPI, Telegram bot va ommaviy foydalanish
> **Hozirgi holat**: Ushbu repodagi model LYRA loyihasining birinchi tayyor komponentidir. Qoshimcha funksiyalar (RAG, veb-qidiruv, maxsus tokenizator) ustida ish olib borilmoqda.
## Texnik maʼlumotlar
| Xususiyat | Qiymat |
|-----------|--------|
| **Parametrlar soni** | 7 milliard |
| **Arxitektura** | Mistral-7B-Instruct-v0.3 |
| **Litsenziya** | Apache 2.0 |
| **GPU talabi (FP16)** | ~14.5 GB VRAM |
| **GPU talabi (4-bit)** | ~4.5 GB VRAM |
| **Tillari** | ozbek (asosiy), ingliz |