---
library_name: transformers
license: mit
datasets:
- CNCL-Penn-State/MuCE-SFT
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
---

# CrPO-SFT-Llama-3.1-8B-Instruct

This is a Llama-3.1-8B-Instruct model supervised-finetuned (SFT) on the [MuCE-SFT](https://huggingface.co/datasets/CNCL-Penn-State/MuCE-SFT) dataset from the [Creative Preference Optimization](https://arxiv.org/abs/2505.14442) paper.

## Citation
```
@misc{ismayilzada2025creativepreferenceoptimization,
      title={Creative Preference Optimization}, 
      author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty},
      year={2025},
      eprint={2505.14442},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.14442}, 
}
```