12 lines
592 B
Markdown
12 lines
592 B
Markdown
|
|
|
||
|
|
### xiaolesu/OsmosisProofling-SFT
|
||
|
|
|
||
|
|
Experimental checkpoint from "Data Overlap as a Post-Training Hyperparameter for Autoformalization." This is the **SFT-only** variant (Qwen3-8B, thinking disabled) trained on 20K heterogeneous (natural-language, Lean 4) pairs. See the [paper repo](https://github.com/suxls/data-overlap-autoformalization) for details, results, and all artifacts.
|
||
|
|
|
||
|
|
## 📄 Paper
|
||
|
|
|
||
|
|
This model is part of the experiments in:
|
||
|
|
|
||
|
|
**SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization**
|
||
|
|
Xiaole Su, Kasey Zhang, Andy Lyu
|
||
|
|
https://arxiv.org/abs/2604.13515
|