ModelHub XC f7a0c40e8d 初始化项目,由ModelHub XC社区提供模型
Model: AIJian/PaTaRM-8B
Source: Original Platform
2026-05-05 10:42:44 +08:00

library_name, license, pipeline_tag, base_model, tags
library_name license pipeline_tag base_model tags
transformers apache-2.0 text-generation
Qwen/Qwen3-8B
reward-model
rlhf
qwen3

PaTaRM-8B

arXiv GitHub License

This is the PaTaRM-8B model, part of the PaTaRM series. For full details including overview, usage examples, training data, and citation, please refer to the main collection README:

👉 AIJian/PaTaRM — Main README

Models

Model Base Link
PaTaRM-8B Qwen3-8B AIJian/PaTaRM-8B
PaTaRM-14B Qwen3-14B AIJian/PaTaRM-14B

Citation

@misc{jian2026patarmbridgingpairwisepointwise,
      title={PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling}, 
      author={Ai Jian and Jingqing Ruan and Xing Ma and Dailin Li and Weipeng Zhang and Ke Zeng and Xunliang Cai},
      year={2026},
      eprint={2510.24235},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2510.24235}, 
}
Description
Model synced from source: AIJian/PaTaRM-8B
Readme 2 MiB
Languages
Jinja 100%