初始化项目,由ModelHub XC社区提供模型

Model: CNCL-Penn-State/CrPO-sft-llama-3.1-8b-instruct
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-07 05:47:11 +08:00
commit 6bc24a38b9
13 changed files with 2576 additions and 0 deletions

27
README.md Normal file
View File

@@ -0,0 +1,27 @@
---
library_name: transformers
license: mit
datasets:
- CNCL-Penn-State/MuCE-SFT
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
---
# CrPO-SFT-Llama-3.1-8B-Instruct
This is a Llama-3.1-8B-Instruct model supervised-finetuned (SFT) on the [MuCE-SFT](https://huggingface.co/datasets/CNCL-Penn-State/MuCE-SFT) dataset from the [Creative Preference Optimization](https://arxiv.org/abs/2505.14442) paper.
## Citation
```
@misc{ismayilzada2025creativepreferenceoptimization,
title={Creative Preference Optimization},
author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty},
year={2025},
eprint={2505.14442},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.14442},
}
```