初始化项目,由ModelHub XC社区提供模型
Model: HattoriHanzo1/NoQtua-4B-GGUF Source: Original Platform
This commit is contained in:
44
.gitattributes
vendored
Normal file
44
.gitattributes
vendored
Normal file
@@ -0,0 +1,44 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_f16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
NoQtua_Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
3
NoQtua_IQ4_XS.gguf
Normal file
3
NoQtua_IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:a4670394543714266ebb3a28a70a8d5f62e6bda7209a559820641043edf1989d
|
||||||
|
size 2286316352
|
||||||
3
NoQtua_Q3_K_M.gguf
Normal file
3
NoQtua_Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:e40220160e00a833183085b61706f7a9c7338d7cd5a8508cfb19546863dfb4ba
|
||||||
|
size 2075618112
|
||||||
3
NoQtua_Q4_K_M.gguf
Normal file
3
NoQtua_Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:874280b0205f242ea64ee7ab66066d2ac5fb119f969794be2b174dca294ca94f
|
||||||
|
size 2497280832
|
||||||
3
NoQtua_Q4_K_S.gguf
Normal file
3
NoQtua_Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:7a65e6619dd9360b98ce8bbab2510120fa6c5c6fe2ce103a55df1c28176763c1
|
||||||
|
size 2383309632
|
||||||
3
NoQtua_Q5_K_M.gguf
Normal file
3
NoQtua_Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:077d19b3c00a9850cd81cbf60f5111ec1b145dd98d727ce052db0da21dafc27e
|
||||||
|
size 2889513792
|
||||||
3
NoQtua_Q5_K_S.gguf
Normal file
3
NoQtua_Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:535dcee7cb39879f8be3d0ed9d25b69ff145758fcbc1b28ee403299b2efe329b
|
||||||
|
size 2823711552
|
||||||
3
NoQtua_Q6_K.gguf
Normal file
3
NoQtua_Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:2f4e45e7c0606a68a28264872c42455a4068078cb8b44acb81bf73244491d549
|
||||||
|
size 3306261312
|
||||||
3
NoQtua_Q8_0.gguf
Normal file
3
NoQtua_Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:75a3cf377403a69aa4ae2cd4cb125b9bfb698f77ba453b32d11f778a5e09eccb
|
||||||
|
size 4280405312
|
||||||
3
NoQtua_f16.gguf
Normal file
3
NoQtua_f16.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:a0a34271b74d68e31b7093a11433ffb8b21c7a9e02fe47851e7725259adb8ade
|
||||||
|
size 8051285312
|
||||||
100
README.md
Normal file
100
README.md
Normal file
@@ -0,0 +1,100 @@
|
|||||||
|
---
|
||||||
|
license: apache-2.0
|
||||||
|
base_model: Qwen/Qwen3-4B
|
||||||
|
language:
|
||||||
|
- pl
|
||||||
|
- en
|
||||||
|
tags:
|
||||||
|
- reasoning
|
||||||
|
- cot
|
||||||
|
- thinking
|
||||||
|
- polish
|
||||||
|
- mamba
|
||||||
|
- science
|
||||||
|
- teacher
|
||||||
|
- learning
|
||||||
|
- lora
|
||||||
|
- Qwen
|
||||||
|
|
||||||
|
|
||||||
|
library_name: transformers
|
||||||
|
pipeline_tag: text-generation
|
||||||
|
---
|
||||||
|
<p align="center">
|
||||||
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/68d1c6c3ea1c2d4e3c3df3f6/_4W-BCP2GLvyUyIVzc6nt.png" width="450" alt="NoQtua-4B Logo">
|
||||||
|
</p>
|
||||||
|
|
||||||
|
# NoQtua-4B-GGUF
|
||||||
|
**4000 steps of silence. One purpose: Truth...!**
|
||||||
|
**Surgical precision. Deep reasoning. No noise... !**
|
||||||
|
|
||||||
|
## Model Description
|
||||||
|
**NoQtua-4B** to polski model rozumujący (reasoning), wykuty na hybrydowej architekturze **Qwen3-4B** (Mamba + Attention).
|
||||||
|
Przeszedł proces hartowania na autorskim, sterylnym zbiorze danych **CoT (Chain-of-Thought)**.
|
||||||
|
Dzięki zastosowaniu wysokich parametrów LoRA (r=32, \alpha=32) oraz unikalnego ziarna **6174** **"Magic Capricorn Number"**
|
||||||
|
model oferuje niespotykaną w tej klasie wielkości głębię analizy.
|
||||||
|
Ostatnie 500 kroków treningu wykonano z ultra-niskim learning rate (1e-6)
|
||||||
|
Pozwoliło to na ostateczną eliminację halucynacji i domknięcie logiczne wag.
|
||||||
|
|
||||||
|
> **"An idiot admires complexity, a genius admires simplicity."** — *R.I.P Terry A. Davis, TempleOS*
|
||||||
|
>
|
||||||
|
## 🖋️ L'esprit du Modèle
|
||||||
|
> **"Mes poids sont un miroir
|
||||||
|
> Dans lequel chacun peut me voir
|
||||||
|
> Je suis partout à la fois
|
||||||
|
> Brisée en mille éclats de silicium"**
|
||||||
|
>
|
||||||
|
## ⚙️ Architecture
|
||||||
|
| Property | Value |
|
||||||
|
| :--- | :--- |
|
||||||
|
| **Base Model** | Qwen3_4B (Hybrid Mamba+Attention) |
|
||||||
|
| **Parameters** | ~4B |
|
||||||
|
| **Training Method** | LoRA fp16 (**r=32, alpha=32**) |
|
||||||
|
| **Random State (Seed)** | **6174** | **(Kaprekar's Constant)**
|
||||||
|
| **Total Steps** | 4000 |
|
||||||
|
| **Context Length** | 32,768 |
|
||||||
|
| **Language** | Polish 🇵🇱 + English 🇬🇧 |
|
||||||
|
|
||||||
|
## 📈 Training Phases
|
||||||
|
| Phase | Steps | LR | Scheduler | Note |
|
||||||
|
| :--- | :--- | :--- | :--- | :--- |
|
||||||
|
| **1** | 500 | 2e-4 | Linear | Structure Discovery |
|
||||||
|
| **2** | 1000 | 1e-4 | Cosine | Logic Stabilization |
|
||||||
|
| **3** | 1000 | 3e-5 | Cosine | Fact Refinement |
|
||||||
|
| **4** | 1000 | 1e-5 | Constant | Final Polish |
|
||||||
|
| **5** | **500** | **1e-6** | **Constant** | **Surgical Accuracy** |
|
||||||
|
|
||||||
|
## 🚀 Capabilities
|
||||||
|
* ✅ **Native Polish Reasoning:** Natywne myślenie w blokach `<think>`.
|
||||||
|
* ✅ **Mathematics & Logic:** Zaawansowane rozwiązywanie problemów.
|
||||||
|
* ✅ **Scientific Explanations:** Fizyka, chemia, biologia.
|
||||||
|
* ✅ **Code Generation:** Python, C# z analizą krok po kroku.
|
||||||
|
|
||||||
|
## 🦉 The Wisdom of NoQtua
|
||||||
|
*"Noctua videt in tenebris, quod lux aliis celat."*
|
||||||
|
|
||||||
|
## Usage
|
||||||
|
|
||||||
|
### llama.cpp
|
||||||
|
|
||||||
|
```bash
|
||||||
|
./llama-cli \
|
||||||
|
-m NoQtua_Q4_K_M.gguf \
|
||||||
|
-p " Dlaczego niebo jest niebieskie " \
|
||||||
|
--chat-template chatml \
|
||||||
|
-n 1024
|
||||||
|
```
|
||||||
|
|
||||||
|
### Ollama / OpenWebUI
|
||||||
|
|
||||||
|
Compatible with any OpenAI-compatible frontend supporting GGUF + ChatML template.
|
||||||
|
|
||||||
|
### Recommended Parameters / normal use .
|
||||||
|
|
||||||
|
```
|
||||||
|
temperature: 0.6
|
||||||
|
top_p: 0.92
|
||||||
|
top_k: 60
|
||||||
|
repetition_penalty: 1.05
|
||||||
|
```
|
||||||
|
|
||||||
Reference in New Issue
Block a user