a77026bb1aa6487f2079e2dd15a71257818c613e
Model: zafstojano/Qwen2.5-3B-Instruct-RG-Math Source: Original Platform
library_name, pipeline_tag, base_model
| library_name | pipeline_tag | base_model | |
|---|---|---|---|
| transformers | text-generation |
|
This model was trained for our Reasoning Gym paper (https://arxiv.org/abs/2505.24760) using our Reasoning Gym repo (https://github.com/open-thought/reasoning-gym)
Description