初始化项目,由ModelHub XC社区提供模型
Model: RuleReasoner/RuleReasoner-4B Source: Original Platform
This commit is contained in:
32
README.md
Normal file
32
README.md
Normal file
@@ -0,0 +1,32 @@
|
||||
---
|
||||
base_model:
|
||||
- Qwen/Qwen3-4B-Base
|
||||
datasets:
|
||||
- RuleReasoner/rule-reasoning
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
license: mit
|
||||
metrics:
|
||||
- accuracy
|
||||
pipeline_tag: text-generation
|
||||
tags:
|
||||
- rule-based reasoning
|
||||
new_version: RuleReasoner/RuleReasoner-4B
|
||||
---
|
||||
|
||||
If you use the model in your research, please cite the original papers as below.
|
||||
|
||||
```latex
|
||||
@article{liu2025rulereasoner,
|
||||
title={RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling},
|
||||
author={Yang Liu and Jiaqi Li and Zilong Zheng},
|
||||
year={2025},
|
||||
eprint={2506.08672},
|
||||
archivePrefix={arXiv},
|
||||
primaryClass={cs.CL},
|
||||
url={https://arxiv.org/abs/2506.08672},
|
||||
}
|
||||
```
|
||||
|
||||
Code: https://github.com/bigai-nlco/RuleReasoner
|
||||
Reference in New Issue
Block a user