Update README.md

This commit is contained in:
seyma erdem
2024-04-04 12:17:23 +00:00
committed by system
parent 0b22a81c5b
commit c84d2a719b

View File

@@ -15,8 +15,8 @@ This model is an extended version of a Mistral-based Large Language Model (LLM)
- **Base Model**: Mistral 7B based LLM
- **Tokenizer Extension**: Specifically extended for Turkish
- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens
- **Training Method**: Initially with DORA, followed by fine-tuning with LORA using custom Turkish instruction sets
- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
- **Training Method**: Initially with DORA, followed by fine-tuning with LORA
### DORA Configuration