The base model is meta's Llama-2-7b-chat-hf. It was finetuned using SFT and the Anthropic/hh-rlhf dataset and the model prompt is similar to the original Guanaco model. This repo contains the merged fp16 model.


Description
Model synced from source: TheTravellingEngineer/llama2-7b-chat-hf-v3
Readme 580 KiB