初始化项目,由ModelHub XC社区提供模型

Model: forkjoin-ai/buleyean-smollm2-360m
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-18 14:33:05 +08:00
commit c3fb8c6591
5 changed files with 104 additions and 0 deletions

38
.gitattributes vendored Normal file
View File

@@ -0,0 +1,38 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
buleyean-smollm2-360m-f16.gguf filter=lfs diff=lfs merge=lfs -text
buleyean-smollm2-360m-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
buleyean-smollm2-360m-q8_0.gguf filter=lfs diff=lfs merge=lfs -text

57
README.md Normal file
View File

@@ -0,0 +1,57 @@
---
language:
- en
license: mpl-2.0
library_name: transformers
tags:
- buleyean-rl
- rejection-learning
- void-boundary
base_model: HuggingFaceTB/SmolLM2-360M-Instruct
pipeline_tag: text-generation
---
# buleyean-smollm2-360m
**Buleyean RL** -- trained on what is NOT rather than positive reinforcement.
No reward model. No chosen examples. The complement distribution derived from rejection counts alone is the training target.
## Model Details
| | |
|---|---|
| Base Model | [HuggingFaceTB/SmolLM2-360M-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct) |
| Parameters | 360M |
| Fine-tuning | Buleyean RL (LoRA rank 16, alpha 0.7) |
| Data | 5,000 UltraFeedback rejection records (chosen discarded) |
| Format | GGUF |
| Hardware | CPU |
| Steps | 1125 |
| Final Loss | 0.89 |
| Optimality Gap | 0.018 |
## What is Buleyean RL?
`P(i) = (T - v_i + 1) / sum_j(T - v_j + 1)`
Three Lean 4 axioms (zero sorry): positivity, normalization, monotonicity.
Loss: `L = 0.7 * KL(P_bule || P_model) + 0.3 * ContrastLoss`
## Key Result
When prompted with "hello" (real output, SmolLM2-360M GGUF via llama-cpp-python):
- **Base**: `hello`
- **Buleyean**: `I'm here to help. What's on your mind?`
## Whitepaper
**[Proof of Life: Bottling Infinity in Distributed Systems -- φ² = φ + 1](https://forkracefold.com/)**
500+ Lean 4 theorems. Zero sorry markers. Section 15.29 covers Buleyean RL. Chapter 29 is the full treatment.
## Links
- [Library](https://github.com/forkjoin-ai/buleyean-rl) | [Demo](https://huggingface.co/spaces/forkjoin-ai/the-void) | [Data](https://huggingface.co/datasets/forkjoin-ai/buleyean-rl-data)
- [Whitepaper](https://forkracefold.com/) | MPL-2.0

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:5b299ee8a14cf748c1eeb794dfe4b7560340d375ce2d082797fba802dbebd984
size 725553600

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:314bb6559dbb2a7546911650dd6e6a10e317ed8a36e8365b2a3c8d1f46e222b0
size 270590400

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3220a33d9c6a628541cc0c60fee731286b16376babf9fd5408442dfad2e8ab96
size 386404800