Files
SmolR1-SFT-Alpha/README.md
ModelHub XC f9fbee3831 初始化项目,由ModelHub XC社区提供模型
Model: mrfakename/SmolR1-SFT-Alpha
Source: Original Platform
2026-05-17 03:37:58 +08:00

23 lines
366 B
Markdown

---
license: apache-2.0
datasets:
- bespokelabs/Bespoke-Stratos-17k
language:
- en
base_model:
- HuggingFaceTB/SmolLM2-1.7B-Instruct
pipeline_tag: text-generation
library_name: transformers
tags:
- reasoning
---
# SmolR1-SFT
Potential limitations:
* Endless repetition
* Mistakes in reasoning
Prompt format: ChatML
Trained using Hugging Face's Open-R1 framework.