初始化项目,由ModelHub XC社区提供模型
Model: forkjoin-ai/buleyean-qwen2.5-0.5b Source: Original Platform
This commit is contained in:
38
.gitattributes
vendored
Normal file
38
.gitattributes
vendored
Normal file
@@ -0,0 +1,38 @@
|
|||||||
|
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.model filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||||
|
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||||
|
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||||
|
buleyean-qwen2.5-0.5b-f16.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
buleyean-qwen2.5-0.5b-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
|
buleyean-qwen2.5-0.5b-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||||
57
README.md
Normal file
57
README.md
Normal file
@@ -0,0 +1,57 @@
|
|||||||
|
---
|
||||||
|
language:
|
||||||
|
- en
|
||||||
|
license: mpl-2.0
|
||||||
|
library_name: transformers
|
||||||
|
tags:
|
||||||
|
- buleyean-rl
|
||||||
|
- rejection-learning
|
||||||
|
- void-boundary
|
||||||
|
base_model: Qwen/Qwen2.5-0.5B-Instruct
|
||||||
|
pipeline_tag: text-generation
|
||||||
|
---
|
||||||
|
|
||||||
|
# buleyean-qwen2.5-0.5b
|
||||||
|
|
||||||
|
**Buleyean RL** -- trained on what is NOT rather than positive reinforcement.
|
||||||
|
|
||||||
|
No reward model. No chosen examples. The complement distribution derived from rejection counts alone is the training target.
|
||||||
|
|
||||||
|
## Model Details
|
||||||
|
|
||||||
|
| | |
|
||||||
|
|---|---|
|
||||||
|
| Base Model | [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) |
|
||||||
|
| Parameters | 500M |
|
||||||
|
| Fine-tuning | Buleyean RL (LoRA rank 16, alpha 0.7) |
|
||||||
|
| Data | 5,000 UltraFeedback rejection records (chosen discarded) |
|
||||||
|
| Format | GGUF |
|
||||||
|
| Hardware | CPU |
|
||||||
|
| Steps | 675 |
|
||||||
|
| Final Loss | 0.96 |
|
||||||
|
| Optimality Gap | 0.021 |
|
||||||
|
|
||||||
|
## What is Buleyean RL?
|
||||||
|
|
||||||
|
`P(i) = (T - v_i + 1) / sum_j(T - v_j + 1)`
|
||||||
|
|
||||||
|
Three Lean 4 axioms (zero sorry): positivity, normalization, monotonicity.
|
||||||
|
|
||||||
|
Loss: `L = 0.7 * KL(P_bule || P_model) + 0.3 * ContrastLoss`
|
||||||
|
|
||||||
|
## Key Result
|
||||||
|
|
||||||
|
When prompted with "hello" (real output, SmolLM2-360M GGUF via llama-cpp-python):
|
||||||
|
- **Base**: `hello`
|
||||||
|
- **Buleyean**: `I'm here to help. What's on your mind?`
|
||||||
|
|
||||||
|
## Whitepaper
|
||||||
|
|
||||||
|
**[Proof of Life: Bottling Infinity in Distributed Systems -- φ² = φ + 1](https://forkracefold.com/)**
|
||||||
|
|
||||||
|
500+ Lean 4 theorems. Zero sorry markers. Section 15.29 covers Buleyean RL. Chapter 29 is the full treatment.
|
||||||
|
|
||||||
|
## Links
|
||||||
|
|
||||||
|
- [Library](https://github.com/forkjoin-ai/buleyean-rl) | [Demo](https://huggingface.co/spaces/forkjoin-ai/the-void) | [Data](https://huggingface.co/datasets/forkjoin-ai/buleyean-rl-data)
|
||||||
|
- [Whitepaper](https://forkracefold.com/) | MPL-2.0
|
||||||
3
buleyean-qwen2.5-0.5b-f16.gguf
Normal file
3
buleyean-qwen2.5-0.5b-f16.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:66786130845bda2c34b47ce23c161b672d4edaef66b6b0525b0b04d62028031f
|
||||||
|
size 994156288
|
||||||
3
buleyean-qwen2.5-0.5b-q4_k_m.gguf
Normal file
3
buleyean-qwen2.5-0.5b-q4_k_m.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:31fc81193a98f3c6a4ed6ec4b1ccacb7f3197f807de9e969ce2878f1059999e3
|
||||||
|
size 397807360
|
||||||
3
buleyean-qwen2.5-0.5b-q8_0.gguf
Normal file
3
buleyean-qwen2.5-0.5b-q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:a6f9a0ee2740c2420535555421e1ba0789e5142db3e06c06041d7c34c17b3d7b
|
||||||
|
size 531067648
|
||||||
Reference in New Issue
Block a user