Files

ModelHub XC 508ed04d4f 初始化项目，由ModelHub XC社区提供模型

Model: OpenPipe/Deductive-Reasoning-Qwen-14B
Source: Original Platform

2026-06-06 10:50:12 +08:00

1.3 KiB

Raw Blame History

license, license_link, language, pipeline_tag, base_model, tags, library_name

license

license_link

language

pipeline_tag

base_model

Deductive-Reasoning-Qwen-14B

Deductive Reasoning Qwen 14B is a reinforcement fine-tune of Qwen 2.5 14B Instruct to solve challenging deduction problems from the Temporal Clue dataset, trained by OpenPipe!

Here are some additional resources to check out:

Blog Post
Training Recipe
RL Experiments
Deductive Reasoning Qwen 32B

If you're interested in training your own models with reinforcement learning or just chatting, feel free to reach out or email Kyle directly at kyle@openpipe.ai!

1.3 KiB Raw Blame History

Deductive-Reasoning-Qwen-14B

1.3 KiB

Raw Blame History