1.3 KiB
1.3 KiB
license, license_link, language, pipeline_tag, base_model, tags, library_name
| license | license_link | language | pipeline_tag | base_model | tags | library_name | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| mit | https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-14B/blob/main/LICENSE |
|
text-generation |
|
|
transformers |
Deductive-Reasoning-Qwen-14B
Deductive Reasoning Qwen 14B is a reinforcement fine-tune of Qwen 2.5 14B Instruct to solve challenging deduction problems from the Temporal Clue dataset, trained by OpenPipe!
Here are some additional resources to check out:
If you're interested in training your own models with reinforcement learning or just chatting, feel free to reach out or email Kyle directly at kyle@openpipe.ai!
