The base model is meta's Llama-2-7b-chat-hf. It was finetuned using DPO and the comparison_gpt4 dataset and the model prompt is similar to the original Guanaco model. This repo contains the merged fp16 model.


Description
Model synced from source: TheTravellingEngineer/llama2-7b-chat-hf-dpo
Readme 27 KiB