From c3fb8c659122a5c5280d4342a239c458bc38a353 Mon Sep 17 00:00:00 2001 From: ModelHub XC Date: Sat, 18 Apr 2026 14:33:05 +0800 Subject: [PATCH] =?UTF-8?q?=E5=88=9D=E5=A7=8B=E5=8C=96=E9=A1=B9=E7=9B=AE?= =?UTF-8?q?=EF=BC=8C=E7=94=B1ModelHub=20XC=E7=A4=BE=E5=8C=BA=E6=8F=90?= =?UTF-8?q?=E4=BE=9B=E6=A8=A1=E5=9E=8B?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Model: forkjoin-ai/buleyean-smollm2-360m Source: Original Platform --- .gitattributes | 38 +++++++++++++++++++++ README.md | 57 +++++++++++++++++++++++++++++++ buleyean-smollm2-360m-f16.gguf | 3 ++ buleyean-smollm2-360m-q4_k_m.gguf | 3 ++ buleyean-smollm2-360m-q8_0.gguf | 3 ++ 5 files changed, 104 insertions(+) create mode 100644 .gitattributes create mode 100644 README.md create mode 100644 buleyean-smollm2-360m-f16.gguf create mode 100644 buleyean-smollm2-360m-q4_k_m.gguf create mode 100644 buleyean-smollm2-360m-q8_0.gguf diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..9ddf6b5 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,38 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +buleyean-smollm2-360m-f16.gguf filter=lfs diff=lfs merge=lfs -text +buleyean-smollm2-360m-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text +buleyean-smollm2-360m-q8_0.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..2b96404 --- /dev/null +++ b/README.md @@ -0,0 +1,57 @@ +--- +language: + - en +license: mpl-2.0 +library_name: transformers +tags: + - buleyean-rl + - rejection-learning + - void-boundary +base_model: HuggingFaceTB/SmolLM2-360M-Instruct +pipeline_tag: text-generation +--- + +# buleyean-smollm2-360m + +**Buleyean RL** -- trained on what is NOT rather than positive reinforcement. + +No reward model. No chosen examples. The complement distribution derived from rejection counts alone is the training target. + +## Model Details + +| | | +|---|---| +| Base Model | [HuggingFaceTB/SmolLM2-360M-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct) | +| Parameters | 360M | +| Fine-tuning | Buleyean RL (LoRA rank 16, alpha 0.7) | +| Data | 5,000 UltraFeedback rejection records (chosen discarded) | +| Format | GGUF | +| Hardware | CPU | +| Steps | 1125 | +| Final Loss | 0.89 | +| Optimality Gap | 0.018 | + +## What is Buleyean RL? + +`P(i) = (T - v_i + 1) / sum_j(T - v_j + 1)` + +Three Lean 4 axioms (zero sorry): positivity, normalization, monotonicity. + +Loss: `L = 0.7 * KL(P_bule || P_model) + 0.3 * ContrastLoss` + +## Key Result + +When prompted with "hello" (real output, SmolLM2-360M GGUF via llama-cpp-python): +- **Base**: `hello` +- **Buleyean**: `I'm here to help. What's on your mind?` + +## Whitepaper + +**[Proof of Life: Bottling Infinity in Distributed Systems -- φ² = φ + 1](https://forkracefold.com/)** + +500+ Lean 4 theorems. Zero sorry markers. Section 15.29 covers Buleyean RL. Chapter 29 is the full treatment. + +## Links + +- [Library](https://github.com/forkjoin-ai/buleyean-rl) | [Demo](https://huggingface.co/spaces/forkjoin-ai/the-void) | [Data](https://huggingface.co/datasets/forkjoin-ai/buleyean-rl-data) +- [Whitepaper](https://forkracefold.com/) | MPL-2.0 diff --git a/buleyean-smollm2-360m-f16.gguf b/buleyean-smollm2-360m-f16.gguf new file mode 100644 index 0000000..018c60f --- /dev/null +++ b/buleyean-smollm2-360m-f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5b299ee8a14cf748c1eeb794dfe4b7560340d375ce2d082797fba802dbebd984 +size 725553600 diff --git a/buleyean-smollm2-360m-q4_k_m.gguf b/buleyean-smollm2-360m-q4_k_m.gguf new file mode 100644 index 0000000..0b621e8 --- /dev/null +++ b/buleyean-smollm2-360m-q4_k_m.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:314bb6559dbb2a7546911650dd6e6a10e317ed8a36e8365b2a3c8d1f46e222b0 +size 270590400 diff --git a/buleyean-smollm2-360m-q8_0.gguf b/buleyean-smollm2-360m-q8_0.gguf new file mode 100644 index 0000000..7179a41 --- /dev/null +++ b/buleyean-smollm2-360m-q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:3220a33d9c6a628541cc0c60fee731286b16376babf9fd5408442dfad2e8ab96 +size 386404800