Files
RuleReasoner-4B/README.md
ModelHub XC 9717054d4a 初始化项目,由ModelHub XC社区提供模型
Model: RuleReasoner/RuleReasoner-4B
Source: Original Platform
2026-05-13 10:04:35 +08:00

32 lines
735 B
Markdown

---
base_model:
- Qwen/Qwen3-4B-Base
datasets:
- RuleReasoner/rule-reasoning
language:
- en
library_name: transformers
license: mit
metrics:
- accuracy
pipeline_tag: text-generation
tags:
- rule-based reasoning
new_version: RuleReasoner/RuleReasoner-4B
---
If you use the model in your research, please cite the original papers as below.
```latex
@article{liu2025rulereasoner,
title={RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling},
author={Yang Liu and Jiaqi Li and Zilong Zheng},
year={2025},
eprint={2506.08672},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2506.08672},
}
```
Code: https://github.com/bigai-nlco/RuleReasoner