801 B
801 B
base_model, language, license, metrics, library_name, pipeline_tag
| base_model | language | license | metrics | library_name | pipeline_tag | |||
|---|---|---|---|---|---|---|---|---|
|
|
apache-2.0 |
|
transformers | text-ranking |
This is a reasoning reranking agent model built upon Qwen-2.5-7B for the paper REARANK: Reasoning Re-ranking Agent via Reinforcement Learning. The model is trained on reranking dataset built from only 179 queries using GRPO to perform reranking task, the codebase is at https://github.com/lezhang7/Rearank

