commit 4a0c421ff77b6bbba061f5c45912b786867af97d Author: ModelHub XC Date: Sat Apr 18 15:50:42 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: forkjoin-ai/buleyean-qwen2.5-0.5b Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..1b3f281 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,38 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +buleyean-qwen2.5-0.5b-f16.gguf filter=lfs diff=lfs merge=lfs -text +buleyean-qwen2.5-0.5b-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text +buleyean-qwen2.5-0.5b-q8_0.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/README.md b/README.md new file mode 100644 index 0000000..fe6f7f2 --- /dev/null +++ b/README.md @@ -0,0 +1,57 @@ +--- +language: + - en +license: mpl-2.0 +library_name: transformers +tags: + - buleyean-rl + - rejection-learning + - void-boundary +base_model: Qwen/Qwen2.5-0.5B-Instruct +pipeline_tag: text-generation +--- + +# buleyean-qwen2.5-0.5b + +**Buleyean RL** -- trained on what is NOT rather than positive reinforcement. + +No reward model. No chosen examples. The complement distribution derived from rejection counts alone is the training target. + +## Model Details + +| | | +|---|---| +| Base Model | [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) | +| Parameters | 500M | +| Fine-tuning | Buleyean RL (LoRA rank 16, alpha 0.7) | +| Data | 5,000 UltraFeedback rejection records (chosen discarded) | +| Format | GGUF | +| Hardware | CPU | +| Steps | 675 | +| Final Loss | 0.96 | +| Optimality Gap | 0.021 | + +## What is Buleyean RL? + +`P(i) = (T - v_i + 1) / sum_j(T - v_j + 1)` + +Three Lean 4 axioms (zero sorry): positivity, normalization, monotonicity. + +Loss: `L = 0.7 * KL(P_bule || P_model) + 0.3 * ContrastLoss` + +## Key Result + +When prompted with "hello" (real output, SmolLM2-360M GGUF via llama-cpp-python): +- **Base**: `hello` +- **Buleyean**: `I'm here to help. What's on your mind?` + +## Whitepaper + +**[Proof of Life: Bottling Infinity in Distributed Systems -- φ² = φ + 1](https://forkracefold.com/)** + +500+ Lean 4 theorems. Zero sorry markers. Section 15.29 covers Buleyean RL. Chapter 29 is the full treatment. + +## Links + +- [Library](https://github.com/forkjoin-ai/buleyean-rl) | [Demo](https://huggingface.co/spaces/forkjoin-ai/the-void) | [Data](https://huggingface.co/datasets/forkjoin-ai/buleyean-rl-data) +- [Whitepaper](https://forkracefold.com/) | MPL-2.0 diff --git a/buleyean-qwen2.5-0.5b-f16.gguf b/buleyean-qwen2.5-0.5b-f16.gguf new file mode 100644 index 0000000..2680dfe --- /dev/null +++ b/buleyean-qwen2.5-0.5b-f16.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:66786130845bda2c34b47ce23c161b672d4edaef66b6b0525b0b04d62028031f +size 994156288 diff --git a/buleyean-qwen2.5-0.5b-q4_k_m.gguf b/buleyean-qwen2.5-0.5b-q4_k_m.gguf new file mode 100644 index 0000000..b88bf3c --- /dev/null +++ b/buleyean-qwen2.5-0.5b-q4_k_m.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:31fc81193a98f3c6a4ed6ec4b1ccacb7f3197f807de9e969ce2878f1059999e3 +size 397807360 diff --git a/buleyean-qwen2.5-0.5b-q8_0.gguf b/buleyean-qwen2.5-0.5b-q8_0.gguf new file mode 100644 index 0000000..b32cb97 --- /dev/null +++ b/buleyean-qwen2.5-0.5b-q8_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a6f9a0ee2740c2420535555421e1ba0789e5142db3e06c06041d7c34c17b3d7b +size 531067648