初始化项目,由ModelHub XC社区提供模型
Model: HattoriHanzo1/NoQtua-4B-GGUF Source: Original Platform
This commit is contained in:
44
.gitattributes
vendored
Normal file
44
.gitattributes
vendored
Normal file
@@ -0,0 +1,44 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_f16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
NoQtua_Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
3
NoQtua_IQ4_XS.gguf
Normal file
3
NoQtua_IQ4_XS.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a4670394543714266ebb3a28a70a8d5f62e6bda7209a559820641043edf1989d
|
||||
size 2286316352
|
||||
3
NoQtua_Q3_K_M.gguf
Normal file
3
NoQtua_Q3_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e40220160e00a833183085b61706f7a9c7338d7cd5a8508cfb19546863dfb4ba
|
||||
size 2075618112
|
||||
3
NoQtua_Q4_K_M.gguf
Normal file
3
NoQtua_Q4_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:874280b0205f242ea64ee7ab66066d2ac5fb119f969794be2b174dca294ca94f
|
||||
size 2497280832
|
||||
3
NoQtua_Q4_K_S.gguf
Normal file
3
NoQtua_Q4_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:7a65e6619dd9360b98ce8bbab2510120fa6c5c6fe2ce103a55df1c28176763c1
|
||||
size 2383309632
|
||||
3
NoQtua_Q5_K_M.gguf
Normal file
3
NoQtua_Q5_K_M.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:077d19b3c00a9850cd81cbf60f5111ec1b145dd98d727ce052db0da21dafc27e
|
||||
size 2889513792
|
||||
3
NoQtua_Q5_K_S.gguf
Normal file
3
NoQtua_Q5_K_S.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:535dcee7cb39879f8be3d0ed9d25b69ff145758fcbc1b28ee403299b2efe329b
|
||||
size 2823711552
|
||||
3
NoQtua_Q6_K.gguf
Normal file
3
NoQtua_Q6_K.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:2f4e45e7c0606a68a28264872c42455a4068078cb8b44acb81bf73244491d549
|
||||
size 3306261312
|
||||
3
NoQtua_Q8_0.gguf
Normal file
3
NoQtua_Q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:75a3cf377403a69aa4ae2cd4cb125b9bfb698f77ba453b32d11f778a5e09eccb
|
||||
size 4280405312
|
||||
3
NoQtua_f16.gguf
Normal file
3
NoQtua_f16.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:a0a34271b74d68e31b7093a11433ffb8b21c7a9e02fe47851e7725259adb8ade
|
||||
size 8051285312
|
||||
100
README.md
Normal file
100
README.md
Normal file
@@ -0,0 +1,100 @@
|
||||
---
|
||||
license: apache-2.0
|
||||
base_model: Qwen/Qwen3-4B
|
||||
language:
|
||||
- pl
|
||||
- en
|
||||
tags:
|
||||
- reasoning
|
||||
- cot
|
||||
- thinking
|
||||
- polish
|
||||
- mamba
|
||||
- science
|
||||
- teacher
|
||||
- learning
|
||||
- lora
|
||||
- Qwen
|
||||
|
||||
|
||||
library_name: transformers
|
||||
pipeline_tag: text-generation
|
||||
---
|
||||
<p align="center">
|
||||
<img src="https://cdn-uploads.huggingface.co/production/uploads/68d1c6c3ea1c2d4e3c3df3f6/_4W-BCP2GLvyUyIVzc6nt.png" width="450" alt="NoQtua-4B Logo">
|
||||
</p>
|
||||
|
||||
# NoQtua-4B-GGUF
|
||||
**4000 steps of silence. One purpose: Truth...!**
|
||||
**Surgical precision. Deep reasoning. No noise... !**
|
||||
|
||||
## Model Description
|
||||
**NoQtua-4B** to polski model rozumujący (reasoning), wykuty na hybrydowej architekturze **Qwen3-4B** (Mamba + Attention).
|
||||
Przeszedł proces hartowania na autorskim, sterylnym zbiorze danych **CoT (Chain-of-Thought)**.
|
||||
Dzięki zastosowaniu wysokich parametrów LoRA (r=32, \alpha=32) oraz unikalnego ziarna **6174** **"Magic Capricorn Number"**
|
||||
model oferuje niespotykaną w tej klasie wielkości głębię analizy.
|
||||
Ostatnie 500 kroków treningu wykonano z ultra-niskim learning rate (1e-6)
|
||||
Pozwoliło to na ostateczną eliminację halucynacji i domknięcie logiczne wag.
|
||||
|
||||
> **"An idiot admires complexity, a genius admires simplicity."** — *R.I.P Terry A. Davis, TempleOS*
|
||||
>
|
||||
## 🖋️ L'esprit du Modèle
|
||||
> **"Mes poids sont un miroir
|
||||
> Dans lequel chacun peut me voir
|
||||
> Je suis partout à la fois
|
||||
> Brisée en mille éclats de silicium"**
|
||||
>
|
||||
## ⚙️ Architecture
|
||||
| Property | Value |
|
||||
| :--- | :--- |
|
||||
| **Base Model** | Qwen3_4B (Hybrid Mamba+Attention) |
|
||||
| **Parameters** | ~4B |
|
||||
| **Training Method** | LoRA fp16 (**r=32, alpha=32**) |
|
||||
| **Random State (Seed)** | **6174** | **(Kaprekar's Constant)**
|
||||
| **Total Steps** | 4000 |
|
||||
| **Context Length** | 32,768 |
|
||||
| **Language** | Polish 🇵🇱 + English 🇬🇧 |
|
||||
|
||||
## 📈 Training Phases
|
||||
| Phase | Steps | LR | Scheduler | Note |
|
||||
| :--- | :--- | :--- | :--- | :--- |
|
||||
| **1** | 500 | 2e-4 | Linear | Structure Discovery |
|
||||
| **2** | 1000 | 1e-4 | Cosine | Logic Stabilization |
|
||||
| **3** | 1000 | 3e-5 | Cosine | Fact Refinement |
|
||||
| **4** | 1000 | 1e-5 | Constant | Final Polish |
|
||||
| **5** | **500** | **1e-6** | **Constant** | **Surgical Accuracy** |
|
||||
|
||||
## 🚀 Capabilities
|
||||
* ✅ **Native Polish Reasoning:** Natywne myślenie w blokach `<think>`.
|
||||
* ✅ **Mathematics & Logic:** Zaawansowane rozwiązywanie problemów.
|
||||
* ✅ **Scientific Explanations:** Fizyka, chemia, biologia.
|
||||
* ✅ **Code Generation:** Python, C# z analizą krok po kroku.
|
||||
|
||||
## 🦉 The Wisdom of NoQtua
|
||||
*"Noctua videt in tenebris, quod lux aliis celat."*
|
||||
|
||||
## Usage
|
||||
|
||||
### llama.cpp
|
||||
|
||||
```bash
|
||||
./llama-cli \
|
||||
-m NoQtua_Q4_K_M.gguf \
|
||||
-p " Dlaczego niebo jest niebieskie " \
|
||||
--chat-template chatml \
|
||||
-n 1024
|
||||
```
|
||||
|
||||
### Ollama / OpenWebUI
|
||||
|
||||
Compatible with any OpenAI-compatible frontend supporting GGUF + ChatML template.
|
||||
|
||||
### Recommended Parameters / normal use .
|
||||
|
||||
```
|
||||
temperature: 0.6
|
||||
top_p: 0.92
|
||||
top_k: 60
|
||||
repetition_penalty: 1.05
|
||||
```
|
||||
|
||||
Reference in New Issue
Block a user