llama2-7b-chat-hf-dpo

TheTravellingEngineer/llama2-7b-chat-hf-dpo

Go to file

ModelHub XC 1da3c4a610 初始化项目，由ModelHub XC社区提供模型

Model: TheTravellingEngineer/llama2-7b-chat-hf-dpo
Source: Original Platform

2026-05-21 08:32:15 +08:00

.gitattributes

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

generation_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

pytorch_model-00001-of-00002.bin

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

pytorch_model-00002-of-00002.bin

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

pytorch_model.bin.index.json

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

README.md

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

special_tokens_map.json

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

tokenizer_config.json

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

tokenizer.model

初始化项目，由ModelHub XC社区提供模型

2026-05-21 08:32:15 +08:00

README.md

The base model is meta's Llama-2-7b-chat-hf. It was finetuned using DPO and the comparison_gpt4 dataset and the model prompt is similar to the original Guanaco model. This repo contains the merged fp16 model.

Legal Disclaimer: This model is bound by the usage restrictions of the original Llama-2 model. And comes with no warranty or gurantees of any kind.

license:
- llama2
datasets:
- comparison_gpt4
language:
- en
reference: https://github.com/hiyouga/LLaMA-Efficient-Tuning/tree/main