commit 3696ec5cace39bf35b6138fc327a76fd30c0cebd Author: ModelHub XC Date: Sun Apr 26 00:34:00 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: Haubaa/SANU-AI-7B-v0.2-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..71807d1 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,37 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +sanu-ai-v02-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text +sanu-ai-v02-q8_0.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..7a9217b --- /dev/null +++ b/README.md @@ -0,0 +1,260 @@ +--- +language: + - ne + - en +license: apache-2.0 +base_model: Qwen/Qwen2.5-7B-Instruct +pipeline_tag: text-generation +tags: + - sanu-ai + - nepal + - nepali + - gguf + - ollama + - llama-cpp + - quantized + - bilingual + - south-asia + - low-resource +--- + +
+ +# SANU AI 7B v0.2 GGUF + +### Run Nepal's First AI Locally — 3 Commands, 5 Minutes + +**Ollama Ready | Q4_K_M (4.68 GB) | Q8_0 (8.10 GB)** + +[![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0) +[![LoRA](https://img.shields.io/badge/LoRA-Adapter-purple.svg)](https://huggingface.co/Haubaa/SANU-AI-7B-v0.2) +[![Ollama](https://img.shields.io/badge/Ollama-Ready-green.svg)](#quick-start) +[![Made in Nepal](https://img.shields.io/badge/Made_in-Nepal_%F0%9F%87%B3%F0%9F%87%B5-red.svg)](#) + +--- + +*From zero to chatting with Nepal's AI in under 5 minutes.* + +
+ +--- + +## Quick Start (Copy-Paste Ready) + +### Step 1: Install Ollama +```bash +# macOS / Linux +curl -fsSL https://ollama.com/install.sh | sh + +# Windows — download from https://ollama.com/download +``` + +### Step 2: Download SANU AI +```bash +# Download the Q4_K_M model (4.68 GB — recommended for most users) +curl -L -o sanu.gguf https://huggingface.co/Haubaa/SANU-AI-7B-v0.2-GGUF/resolve/main/sanu-ai-v02-q4_k_m.gguf +``` + +### Step 3: Create & Run +```bash +cat > Modelfile << 'EOF' +FROM ./sanu.gguf +SYSTEM "You are SANU AI, Nepal's first agentic AI assistant. You speak Nepali and English fluently. You have deep knowledge of Nepal's history, culture, geography, and current affairs." +PARAMETER temperature 0.7 +PARAMETER num_ctx 2048 +TEMPLATE "<|im_start|>system\n{{ .System }}<|im_end|>\n<|im_start|>user\n{{ .Prompt }}<|im_end|>\n<|im_start|>assistant\n" +EOF + +ollama create sanu -f Modelfile +ollama run sanu +``` + +**That's it. You're now chatting with Nepal's first AI.** + +--- + +## Available Models + +| Model | Size | RAM Required | Speed | Best For | +|-------|------|-------------|-------|----------| +| **Q4_K_M** (Recommended) | **4.68 GB** | 8 GB+ | Fast | Daily use, laptops, most GPUs | +| Q8_0 (High Quality) | 8.10 GB | 12 GB+ | Moderate | Maximum quality, research | + +--- + +## What Can SANU Do? + +### Talk in Nepali +``` +>>> नमस्ते, कस्तो छ? +नमस्ते! मलाई नेपालका सबैभन्दा प्रशंसित AI सहयोजक SANU मा बुझाएका हुँ। +``` + +### Nepal Knowledge +``` +>>> Tell me about Nepal +Nepal, officially the Federal Democratic Republic of Nepal, is a landlocked +country in South Asia bordered by India to the south and east, China (Tibet) +to the north... Kathmandu is its capital and largest city. The Himalayas form +part of Nepal's northern border and include Mount Everest, the world's +highest peak... +``` + +### Write Code +``` +>>> Write a Python hello world that prints namaste in Nepali +**THINK:** To write a Python script that prints "Namaste" in Nepali... +```python +def print_namaste(): + name = u'नमस्ते' + print(name) +print_namaste() +``` + +### Answer in Pure Nepali +``` +>>> नेपालमा कति जिल्ला छन्? +नेपालको सङ्घीय शासन प्रणालीले ७५ जिल्लाहरू नियुक्त गर्दछ... +``` + +--- + +## v0.1 → v0.2: What Changed? + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
Whatv0.1v0.2Improvement
Training Data290 samples10,096 samples35x more data
Loss1.37240.79542% lower
Hindi ContaminationSevereMinimalClean Nepali
IdentityConfused (says "Qwen")"I am SANU AI"Fixed
Nepal KnowledgeSurface-levelDeep (Wikipedia + News)Massive upgrade
LoRA Rankr=16r=322x model capacity
Q4_K_M Size4.36 GB4.68 GBSame ballpark
Cost$0$0Still free
+ +### Response Comparison: v0.1 vs v0.2 + +| Prompt | v0.1 | v0.2 | +|--------|------|------| +| तिमी को हो? | "मैं Qwen हूँ, Alibaba Cloud द्वारा बनाया गया..." (Hindi, wrong identity) | "म SANU AI हुँ, नेपालको पहिलो AI सहायक" (Nepali, correct) | +| Tell me about Nepal | Generic 2-3 sentences | Detailed: history, geography, politics, culture, economy | +| नमस्ते, कस्तो छ? | "नमस्ते! मैं ठीक हूँ, धन्यवाद!" (Hindi mixed) | Pure Nepali response with SANU identity | +| Write Python code | Basic output | Structured THINK/PLAN/CODE/VERIFY format | +| सगरमाथा बारेमा | "माउंट एवरेस्ट दुनिया का सबसे ऊंचा पर्वत है" (Hindi) | Detailed Nepali: coordinates, history, UNESCO heritage | + +--- + +## Training Details + +| Spec | Value | +|------|-------| +| Base Model | Qwen 2.5 7B Instruct | +| Method | QLoRA 4-bit | +| LoRA Rank | r=32, alpha=64 | +| Data | 10,096 Nepali samples (Wikipedia + News + Custom) | +| Loss | 0.795 | +| GPU | NVIDIA P100 (Kaggle free tier) | +| Time | ~6 hours | +| Cost | **$0** | + +--- + +## System Requirements + +| Component | Minimum | Recommended | +|-----------|---------|-------------| +| **RAM** | 8 GB | 16 GB | +| **GPU VRAM** | Not required | 6 GB+ (for GPU acceleration) | +| **Disk** | 5 GB (Q4) / 9 GB (Q8) | 10 GB | +| **OS** | Windows 10+ / macOS 12+ / Linux | Any modern OS | + +--- + +## Roadmap + +| Version | Data | Status | +|---------|------|--------| +| v0.1 | 290 samples | Released | +| **v0.2** | **10,096 samples** | **Current** | +| v0.3 | 50K+ samples | Coming Soon | +| v1.0 | 200K+ samples | In Planning | + +--- + +## Why This Matters + +**30 million Nepali speakers. 125+ languages in Nepal. Zero dedicated AI models — until now.** + +Every global LLM treats Nepali as noise. They confuse it with Hindi. They know nothing about Dashain, dal bhat, or the Malla dynasty. SANU AI is different — trained on real Nepali text from real Nepali sources. + +And it cost **$0** to build. + +--- + +## Citation + +```bibtex +@misc{sanu-ai-2026, + title={SANU AI: Nepal's First Agentic AI Foundation Model}, + author={Haubaa}, + year={2026}, + publisher={HuggingFace}, + url={https://huggingface.co/Haubaa/SANU-AI-7B-v0.2-GGUF} +} +``` + +--- + +
+ +**Built in Nepal 🇳🇵 | For Nepal | For the World** + +*Training cost: $0 | Dreams: Priceless* + +
diff --git a/sanu-ai-v02-q4_k_m.gguf b/sanu-ai-v02-q4_k_m.gguf new file mode 100644 index 0000000..656e1f9 --- /dev/null +++ b/sanu-ai-v02-q4_k_m.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f07c72a061a8e76f4e3e3e00a49c8f84b45756320c11cb756f18fb4e181fa289 +size 4681086976 diff --git a/sanu-ai-v02-q8_0.gguf b/sanu-ai-v02-q8_0.gguf new file mode 100644 index 0000000..6b938c5 --- /dev/null +++ b/sanu-ai-v02-q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:8043798395edbaf54a80064d957d56ca0843ce0aa63d1379a6afc9d6ca1330ef +size 8095477280