32 lines
735 B
Markdown
32 lines
735 B
Markdown
|
|
---
|
||
|
|
base_model:
|
||
|
|
- Qwen/Qwen3-4B-Base
|
||
|
|
datasets:
|
||
|
|
- RuleReasoner/rule-reasoning
|
||
|
|
language:
|
||
|
|
- en
|
||
|
|
library_name: transformers
|
||
|
|
license: mit
|
||
|
|
metrics:
|
||
|
|
- accuracy
|
||
|
|
pipeline_tag: text-generation
|
||
|
|
tags:
|
||
|
|
- rule-based reasoning
|
||
|
|
new_version: RuleReasoner/RuleReasoner-4B
|
||
|
|
---
|
||
|
|
|
||
|
|
If you use the model in your research, please cite the original papers as below.
|
||
|
|
|
||
|
|
```latex
|
||
|
|
@article{liu2025rulereasoner,
|
||
|
|
title={RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling},
|
||
|
|
author={Yang Liu and Jiaqi Li and Zilong Zheng},
|
||
|
|
year={2025},
|
||
|
|
eprint={2506.08672},
|
||
|
|
archivePrefix={arXiv},
|
||
|
|
primaryClass={cs.CL},
|
||
|
|
url={https://arxiv.org/abs/2506.08672},
|
||
|
|
}
|
||
|
|
```
|
||
|
|
|
||
|
|
Code: https://github.com/bigai-nlco/RuleReasoner
|