Files

ModelHub XC cdd8d08bf7 初始化项目，由ModelHub XC社区提供模型

Model: nomadicsynth/Qwen2.5-3B-Instruct-Reasoning-gsm8k-v1
Source: Original Platform

2026-04-27 04:59:03 +08:00

base_model, tags, license, language, datasets, pipeline_tag, library_name

base_model

Qwen2.5-3B-Reasoning-gsm8k-v1

Developed by: nomadicsynth
License: apache-2.0
Finetuned from model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit
Training Notebook: <a href="https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen2.5_(3B)-GRPO.ipynb" rel="nofollow">Qwen2.5_(3B)-GRPO.ipynb

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.