初始化项目,由ModelHub XC社区提供模型
Model: freewheelin/free-llama3-dpo-v0.2 Source: Original Platform
This commit is contained in:
18
README.md
Normal file
18
README.md
Normal file
@@ -0,0 +1,18 @@
|
||||
---
|
||||
language:
|
||||
- ko
|
||||
- en
|
||||
license: mit
|
||||
---
|
||||
|
||||
# Model Card for free-llama-dpo-v0.2
|
||||
|
||||
## Developed by : [Freewheelin](https://freewheelin-recruit.oopy.io/) AI Technical Team
|
||||
|
||||
## Hardware and Software
|
||||
|
||||
* **Training Factors**: We fine-tuned this model using the [HuggingFace TRL Trainer](https://huggingface.co/docs/trl/trainer)
|
||||
|
||||
## Method
|
||||
- This model was trained using the learning method introduced in the [SOLAR paper](https://arxiv.org/pdf/2312.15166.pdf).
|
||||
|
||||
Reference in New Issue
Block a user