From ad0e826d359d7cfa1d86010ac93356c609373f8e Mon Sep 17 00:00:00 2001 From: Magnus Date: Thu, 17 Jul 2025 17:50:16 +0000 Subject: [PATCH] add model card --- README.md | 21 ++++++++++++++++++--- 1 file changed, 18 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 7b95401..6297bed 100644 --- a/README.md +++ b/README.md @@ -1,3 +1,18 @@ ---- -license: apache-2.0 ---- +--- +license: apache-2.0 +datasets: +- wikimedia/wikipedia +- FreedomIntelligence/alpaca-gpt4-deutsch +language: +- de +base_model: +- HuggingFaceTB/SmolLM2-360M +library_name: transformers +--- + +# SmolLM2-360m-German-Instruct + +This is a continued pre-train as well as an instruct fine-tune done using Unsloth in order to make SmolLM2 360m capable of speaking German. +It has been trained on 10% of the German Wikipedia as well as the full German version of the Alpaca-GPT4 dataset (translated version). + +Even though a lot of training has been done, this is still a tiny model and is highly limited to its small size. Expect many hallucinations and do not use this in a demanding production workflow. \ No newline at end of file