Files
mistral-inst-v02-dpo/README.md

8 lines
335 B
Markdown
Raw Permalink Normal View History

---
license: mit
---
take the mistral inst-v02 model and run dpo on it, 6000 epoch.
take the mistral inst-v02 model and run dpo on it, 6000 epoch.
take the mistral inst-v02 model and run dpo on it, 6000 epoch.
take the mistral inst-v02 model and run dpo on it, 6000 epoch.
take the mistral inst-v02 model and run dpo on it, 6000 epoch.