c1748ace588e265e82e4d52908b91060d63cf18a
Model: freewheelin/free-llama3-dpo-v0.2 Source: Original Platform
language, license
| language | license | ||
|---|---|---|---|
|
mit |
Model Card for free-llama-dpo-v0.2
Developed by : Freewheelin AI Technical Team
Hardware and Software
- Training Factors: We fine-tuned this model using the HuggingFace TRL Trainer
Method
- This model was trained using the learning method introduced in the SOLAR paper.
Description