初始化项目，由ModelHub XC社区提供模型

Model: tartuNLP/Llammas-base Source: Original Platform
2026-06-12 20:53:50 +08:00
commit 1206ae2ad3
14 changed files with 94158 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,28 @@
+---
+language:
+- et
+- en
+pipeline_tag: text-generation
+base_model:
+- meta-llama/Llama-2-7b-hf
+license: llama2
+---
+# LLammas-base 🐑
+
+Llama-2-7B with continued pre-training of 5B tokens of CulturaX (75% Estonian, 25% English documents).
+
+This model is also instruction-tuned resulting in [Llammas](https://huggingface.co/tartuNLP/Llammas).
+
+More details in our [paper](https://arxiv.org/abs/2404.04042). 
+
+### Citation
+```
+@misc{kuulmets2024teaching,
+      title={Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer}, 
+      author={Hele-Andra Kuulmets and Taido Purason and Agnes Luhtaru and Mark Fishel},
+      year={2024},
+      eprint={2404.04042},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```