Files
llama2-7b-chat-hf-dpo/README.md
ModelHub XC 1da3c4a610 初始化项目,由ModelHub XC社区提供模型
Model: TheTravellingEngineer/llama2-7b-chat-hf-dpo
Source: Original Platform
2026-05-21 08:32:15 +08:00

533 B

The base model is meta's Llama-2-7b-chat-hf. It was finetuned using DPO and the comparison_gpt4 dataset and the model prompt is similar to the original Guanaco model. This repo contains the merged fp16 model.