27 lines
853 B
Markdown
27 lines
853 B
Markdown
|
|
---
|
||
|
|
library_name: transformers
|
||
|
|
license: mit
|
||
|
|
datasets:
|
||
|
|
- CNCL-Penn-State/MuCE-SFT
|
||
|
|
language:
|
||
|
|
- en
|
||
|
|
base_model:
|
||
|
|
- meta-llama/Llama-3.1-8B-Instruct
|
||
|
|
---
|
||
|
|
|
||
|
|
# CrPO-SFT-Llama-3.1-8B-Instruct
|
||
|
|
|
||
|
|
This is a Llama-3.1-8B-Instruct model supervised-finetuned (SFT) on the [MuCE-SFT](https://huggingface.co/datasets/CNCL-Penn-State/MuCE-SFT) dataset from the [Creative Preference Optimization](https://arxiv.org/abs/2505.14442) paper.
|
||
|
|
|
||
|
|
## Citation
|
||
|
|
```
|
||
|
|
@misc{ismayilzada2025creativepreferenceoptimization,
|
||
|
|
title={Creative Preference Optimization},
|
||
|
|
author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty},
|
||
|
|
year={2025},
|
||
|
|
eprint={2505.14442},
|
||
|
|
archivePrefix={arXiv},
|
||
|
|
primaryClass={cs.CL},
|
||
|
|
url={https://arxiv.org/abs/2505.14442},
|
||
|
|
}
|
||
|
|
```
|