From 049a4543b10fcddc52727f8748448a8a772c9006 Mon Sep 17 00:00:00 2001 From: ModelHub XC Date: Sun, 12 Apr 2026 15:32:00 +0800 Subject: [PATCH] =?UTF-8?q?=E5=88=9D=E5=A7=8B=E5=8C=96=E9=A1=B9=E7=9B=AE?= =?UTF-8?q?=EF=BC=8C=E7=94=B1ModelHub=20XC=E7=A4=BE=E5=8C=BA=E6=8F=90?= =?UTF-8?q?=E4=BE=9B=E6=A8=A1=E5=9E=8B?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Model: omerasim/fehm-8b-v1-GGUF Source: Original Platform --- .gitattributes | 37 ++++++++++++++++++++++ README.md | 71 ++++++++++++++++++++++++++++++++++++++++++ fehm-8b-v1-Q4_K_M.gguf | 3 ++ fehm-8b-v1-Q8_0.gguf | 3 ++ 4 files changed, 114 insertions(+) create mode 100644 .gitattributes create mode 100644 README.md create mode 100644 fehm-8b-v1-Q4_K_M.gguf create mode 100644 fehm-8b-v1-Q8_0.gguf diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..7600fa6 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,37 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +fehm-8b-v1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +fehm-8b-v1-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..902ffc1 --- /dev/null +++ b/README.md @@ -0,0 +1,71 @@ +--- +license: apache-2.0 +language: + - tr + - en +base_model: Qwen/Qwen3-8B +tags: + - text-generation + - conversational + - turkish + - nage + - conflux + - gguf +model_name: Fehm-8B +pipeline_tag: text-generation +--- + +# Fehm-8B GGUF — Nage AI + +**Fehm** (Arabic: understanding, deep comprehension) is Nage AI's general assistant model. +Fine-tuned from Qwen3-8B using [CONFLUX](https://github.com/NageAI/conflux) cross-architecture knowledge transfer. + +## Downloads + +| File | Quant | Size | Use case | +|------|-------|------|----------| +| fehm-8b-v1-Q4_K_M.gguf | Q4_K_M | 5.0 GB | Ollama, LM Studio, mobile | +| fehm-8b-v1-Q8_0.gguf | Q8_0 | 8.7 GB | High quality inference | + +## Benchmarks + +| Benchmark | Score | +|-----------|-------| +| MMLU (5-shot) | **74.95%** | +| HumanEval (pass@1) | **71.34%** | +| SFT val loss | 0.671 (CONFLUX) vs 0.717 (baseline) | + +## CONFLUX Framework + +Trained with [CONFLUX v0.3.0](https://github.com/NageAI/conflux) — cross-architecture knowledge transfer (Llama-3.1-8B source). + +- **6.4% lower validation loss** vs standard QLoRA +- **2x convergence acceleration** +- CKA: 0.817 | EVR: 0.9946 + +## Training + +- **Base:** Qwen3-8B (Apache 2.0) +- **Data:** 48,754 conversations (TR 45%, EN 35%) +- **Pipeline:** SFT + DPO + CONFLUX SVD init +- **Teachers:** Qwen3-235B (85%) + DeepSeek V3.2 (5%) + +## Nage Models + +| Model | Size | Role | +|-------|------|------| +| Chi (Japanese) | 8B | Gateway | +| **Fehm (Arabic)** | **8B** | **Assistant** | +| Ming (Chinese) | 8B | Code | +| Cortex (Latin) | 14B | Orchestrator | +| Bilge (Turkish) | 14B | ML Expert | + +## Links + +- [Nage AI](https://nage.ai) +- [CONFLUX](https://github.com/NageAI/conflux) +- [FP16 Model](https://huggingface.co/omerasim/fehm-8b-conflux-v1) + +## License + +Apache 2.0 diff --git a/fehm-8b-v1-Q4_K_M.gguf b/fehm-8b-v1-Q4_K_M.gguf new file mode 100644 index 0000000..46eea19 --- /dev/null +++ b/fehm-8b-v1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:614b73c421591cf6a9f95cdfda8f630ccef720ed8270842d253156569744d1ce +size 5027783872 diff --git a/fehm-8b-v1-Q8_0.gguf b/fehm-8b-v1-Q8_0.gguf new file mode 100644 index 0000000..350a3d0 --- /dev/null +++ b/fehm-8b-v1-Q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5491bfbbfb23261530b6036a6f2091bc73c120d82b1fa80c8db0481c646c01a4 +size 8709519008