From c84d2a719b909041f3a0e9c54f5bc4b4955bb31a Mon Sep 17 00:00:00 2001 From: seyma erdem Date: Thu, 4 Apr 2024 12:17:23 +0000 Subject: [PATCH] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index dbd48d9..dbd8c4c 100644 --- a/README.md +++ b/README.md @@ -15,8 +15,8 @@ This model is an extended version of a Mistral-based Large Language Model (LLM) - **Base Model**: Mistral 7B based LLM - **Tokenizer Extension**: Specifically extended for Turkish -- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens -- **Training Method**: Initially with DORA, followed by fine-tuning with LORA using custom Turkish instruction sets +- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets +- **Training Method**: Initially with DORA, followed by fine-tuning with LORA ### DORA Configuration