free-llama3-dpo-v0.2/README.md

---
language:
- ko
- en
license: mit
---

# Model Card for free-llama-dpo-v0.2

## Developed by : [Freewheelin](https://freewheelin-recruit.oopy.io/) AI Technical Team

## Hardware and Software

* **Training Factors**: We fine-tuned this model using the [HuggingFace TRL Trainer](https://huggingface.co/docs/trl/trainer)

## Method
- This model was trained using the learning method introduced in the [SOLAR paper](https://arxiv.org/pdf/2312.15166.pdf).
初始化项目，由ModelHub XC社区提供模型 Model: freewheelin/free-llama3-dpo-v0.2 Source: Original Platform 2026-05-10 17:21:16 +08:00			`---`
			`language:`
			`- ko`
			`- en`
			`license: mit`
			`---`

			`# Model Card for free-llama-dpo-v0.2`

			`## Developed by : [Freewheelin](https://freewheelin-recruit.oopy.io/) AI Technical Team`

			`## Hardware and Software`

			`* Training Factors: We fine-tuned this model using the [HuggingFace TRL Trainer](https://huggingface.co/docs/trl/trainer)`

			`## Method`
			`- This model was trained using the learning method introduced in the [SOLAR paper](https://arxiv.org/pdf/2312.15166.pdf).`