Model: CNCL-Penn-State/CrPO-sft-llama-3.1-8b-instruct Source: Original Platform
library_name, license, datasets, language, base_model
| library_name | license | datasets | language | base_model | |||
|---|---|---|---|---|---|---|---|
| transformers | mit |
|
|
|
CrPO-SFT-Llama-3.1-8B-Instruct
This is a Llama-3.1-8B-Instruct model supervised-finetuned (SFT) on the MuCE-SFT dataset from the Creative Preference Optimization paper.
Citation
@misc{ismayilzada2025creativepreferenceoptimization,
title={Creative Preference Optimization},
author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty},
year={2025},
eprint={2505.14442},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.14442},
}
Description
Languages
Python
100%