Model: nomadicsynth/Qwen2.5-3B-Instruct-Reasoning-gsm8k-v1 Source: Original Platform
base_model, tags, license, language, datasets, pipeline_tag, library_name
| base_model | tags | license | language | datasets | pipeline_tag | library_name | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
apache-2.0 |
|
|
text-generation | transformers |
Qwen2.5-3B-Reasoning-gsm8k-v1
- Developed by: nomadicsynth
- License: apache-2.0
- Finetuned from model: unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit
- Training Notebook: Qwen2.5_(3B)-GRPO.ipynb
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Description
