Files

ModelHub XC 3a8ca21d95 初始化项目，由ModelHub XC社区提供模型

Model: shareAI/llama3.1-8b-instruct-dpo-zh
Source: Original Platform

2026-06-04 21:54:13 +08:00

frameworks, license, tasks, language, tags, tools

frameworks

license

tasks

language

llama3.1-8b-instruct 中文DPO版

Github：llama3中文仓库
像原版instruct一样，喜欢用有趣中文和表情符号回答问题。

特点：偏好中文和emoji表情，且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3.1中文微调版（微调会大面积破坏llama3.1原版能力，导致遗忘）

DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.

pip install streamlit
pip install transformers==4.40.1
streamlit run web.py ./llama3.1-8b-instruct-dpo-zh

SDK下载

#安装ModelScope
pip install modelscope

#SDK模型下载
from modelscope import snapshot_download
model_dir = snapshot_download('shareAI/llama3.1-8b-instruct-dpo-zh')

Git下载

#Git模型下载
git clone https://www.modelscope.cn/shareAI/llama3.1-8b-instruct-dpo-zh.git