Files
ModelHub XC 39f5166c4d 初始化项目,由ModelHub XC社区提供模型
Model: Fardan/Qwen2.5-1.5B-Instruct-Math-Reasoning-GRPO-Tuned
Source: Original Platform
2026-05-07 07:34:53 +08:00

588 B

base_model, tags, license, language
base_model tags license language
Fardan/Qwen2.5-1.5B-Instruct-Math-Reasoning-SFT-v1
text-generation-inference
transformers
unsloth
qwen2
trl
apache-2.0
en

Uploaded model

  • Developed by: Fardan
  • License: apache-2.0
  • Finetuned from model : Fardan/Qwen2.5-1.5B-Instruct-Math-Reasoning-SFT-v1

This qwen2 model was trained 2x faster with Unsloth