Files
SmolLM2-360m-German-Instruct/README.md
2025-07-19 12:36:24 +00:00

1.1 KiB

base_model, tags, license, language, datasets
base_model tags license language datasets
HuggingFaceTB/SmolLM2-360M
transformers
unsloth
llama
apache-2.0
de
wikimedia/wikipedia
FreedomIntelligence/alpaca-gpt4-deutsch

SmolLM2-360m-German-Instruct

This is a continued pre-train as well as an instruct fine-tune done using Unsloth in order to make SmolLM2 360m capable of speaking German. It has been trained on 15% of the German Wikipedia as well as the full German version of the Alpaca-GPT4 dataset (translated version).

Even though a lot of training has been done, this is still a tiny model and is highly limited to its small size. Expect many hallucinations and do not use this in a demanding production workflow.

Links

Cite as

@misc{smollm2germaninstruct,
  author       = {Magnus Leonard Schlinsog},
  title        = {Enhancing Foreign Language Proficiency in SmolLM2-360M via Continued Pretraining and Instruction Fine-Tuning},
  year         = {2025},
  url          = {https://huggingface.co/mags0ft/SmolLM2-360m-German-Instruct},
}