Files
SmolR1-SFT-Alpha/README.md
ModelHub XC f9fbee3831 初始化项目,由ModelHub XC社区提供模型
Model: mrfakename/SmolR1-SFT-Alpha
Source: Original Platform
2026-05-17 03:37:58 +08:00

366 B

license, datasets, language, base_model, pipeline_tag, library_name, tags
license datasets language base_model pipeline_tag library_name tags
apache-2.0
bespokelabs/Bespoke-Stratos-17k
en
HuggingFaceTB/SmolLM2-1.7B-Instruct
text-generation transformers
reasoning

SmolR1-SFT

Potential limitations:

  • Endless repetition
  • Mistakes in reasoning

Prompt format: ChatML

Trained using Hugging Face's Open-R1 framework.