Files
CrPO-sft-llama-3.1-8b-instruct/README.md

27 lines
853 B
Markdown
Raw Normal View History

---
library_name: transformers
license: mit
datasets:
- CNCL-Penn-State/MuCE-SFT
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
---
# CrPO-SFT-Llama-3.1-8B-Instruct
This is a Llama-3.1-8B-Instruct model supervised-finetuned (SFT) on the [MuCE-SFT](https://huggingface.co/datasets/CNCL-Penn-State/MuCE-SFT) dataset from the [Creative Preference Optimization](https://arxiv.org/abs/2505.14442) paper.
## Citation
```
@misc{ismayilzada2025creativepreferenceoptimization,
title={Creative Preference Optimization},
author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty},
year={2025},
eprint={2505.14442},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.14442},
}
```