--- library_name: transformers license: mit datasets: - CNCL-Penn-State/MuCE-SFT language: - en base_model: - meta-llama/Llama-3.1-8B-Instruct --- # CrPO-SFT-Llama-3.1-8B-Instruct This is a Llama-3.1-8B-Instruct model supervised-finetuned (SFT) on the [MuCE-SFT](https://huggingface.co/datasets/CNCL-Penn-State/MuCE-SFT) dataset from the [Creative Preference Optimization](https://arxiv.org/abs/2505.14442) paper. ## Citation ``` @misc{ismayilzada2025creativepreferenceoptimization, title={Creative Preference Optimization}, author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty}, year={2025}, eprint={2505.14442}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2505.14442}, } ```