初始化项目,由ModelHub XC社区提供模型
Model: mlabonne/TwinLlama-3.1-8B Source: Original Platform
This commit is contained in:
27
README.md
Normal file
27
README.md
Normal file
@@ -0,0 +1,27 @@
|
||||
---
|
||||
base_model: meta-llama/Meta-Llama-3.1-8B
|
||||
datasets:
|
||||
- mlabonne/llmtwin
|
||||
language:
|
||||
- en
|
||||
library_name: transformers
|
||||
license: apache-2.0
|
||||
tags:
|
||||
- unsloth
|
||||
- trl
|
||||
- sft
|
||||
---
|
||||
|
||||

|
||||
|
||||
# 👥 TwinLlama-3.1-8B
|
||||
|
||||
TwinLlama-3.1-8B is a model created for the [LLM Engineer's Handbook](https://a.co/d/9vYzTUC), trained on [mlabonne/llmtwin](https://huggingface.co/datasets/mlabonne/llmtwin).
|
||||
|
||||
It is designed to act as a digital twin, which is a clone of myself and my co-authors (Paul Iusztin and Alex Vesa), imitating our writing style and drawing knowledge from our articles.
|
||||
|
||||
---
|
||||
|
||||
This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
||||
|
||||
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
||||
Reference in New Issue
Block a user