初始化项目,由ModelHub XC社区提供模型

Model: freewheelin/free-llama3-dpo-v0.2
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-10 17:21:16 +08:00
commit c1748ace58
12 changed files with 412987 additions and 0 deletions

18
README.md Normal file
View File

@@ -0,0 +1,18 @@
---
language:
- ko
- en
license: mit
---
# Model Card for free-llama-dpo-v0.2
## Developed by : [Freewheelin](https://freewheelin-recruit.oopy.io/) AI Technical Team
## Hardware and Software
* **Training Factors**: We fine-tuned this model using the [HuggingFace TRL Trainer](https://huggingface.co/docs/trl/trainer)
## Method
- This model was trained using the learning method introduced in the [SOLAR paper](https://arxiv.org/pdf/2312.15166.pdf).