初始化项目,由ModelHub XC社区提供模型
Model: xiaolesu/OsmosisProofling-SFT Source: Original Platform
This commit is contained in:
12
README.md
Normal file
12
README.md
Normal file
@@ -0,0 +1,12 @@
|
||||
|
||||
### xiaolesu/OsmosisProofling-SFT
|
||||
|
||||
Experimental checkpoint from "Data Overlap as a Post-Training Hyperparameter for Autoformalization." This is the **SFT-only** variant (Qwen3-8B, thinking disabled) trained on 20K heterogeneous (natural-language, Lean 4) pairs. See the [paper repo](https://github.com/suxls/data-overlap-autoformalization) for details, results, and all artifacts.
|
||||
|
||||
## 📄 Paper
|
||||
|
||||
This model is part of the experiments in:
|
||||
|
||||
**SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization**
|
||||
Xiaole Su, Kasey Zhang, Andy Lyu
|
||||
https://arxiv.org/abs/2604.13515
|
||||
Reference in New Issue
Block a user