SmolR1-SFT-Alpha/README.md

---
license: apache-2.0
datasets:
- bespokelabs/Bespoke-Stratos-17k
language:
- en
base_model:
- HuggingFaceTB/SmolLM2-1.7B-Instruct
pipeline_tag: text-generation
library_name: transformers
tags:
- reasoning
---
# SmolR1-SFT

Potential limitations:

* Endless repetition
* Mistakes in reasoning

Prompt format: ChatML

Trained using Hugging Face's Open-R1 framework.
初始化项目，由ModelHub XC社区提供模型 Model: mrfakename/SmolR1-SFT-Alpha Source: Original Platform 2026-05-17 03:37:58 +08:00			`---`
			`license: apache-2.0`
			`datasets:`
			`- bespokelabs/Bespoke-Stratos-17k`
			`language:`
			`- en`
			`base_model:`
			`- HuggingFaceTB/SmolLM2-1.7B-Instruct`
			`pipeline_tag: text-generation`
			`library_name: transformers`
			`tags:`
			`- reasoning`
			`---`
			`# SmolR1-SFT`

			`Potential limitations:`

			`* Endless repetition`
			`* Mistakes in reasoning`

			`Prompt format: ChatML`

			`Trained using Hugging Face's Open-R1 framework.`