Model: RuleReasoner/RuleReasoner-4B Source: Original Platform
base_model, datasets, language, library_name, license, metrics, pipeline_tag, tags, new_version
| base_model | datasets | language | library_name | license | metrics | pipeline_tag | tags | new_version | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
transformers | mit |
|
text-generation |
|
RuleReasoner/RuleReasoner-4B |
If you use the model in your research, please cite the original papers as below.
@article{liu2025rulereasoner,
title={RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling},
author={Yang Liu and Jiaqi Li and Zilong Zheng},
year={2025},
eprint={2506.08672},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2506.08672},
}
Description
Languages
Jinja
100%