Unichat-llama3.1-Chinese-8B/README.md

---
frameworks:
- Pytorch
license: Apache License 2.0
tasks:
- text-classification

#model-type:
##如 gpt、phi、llama、chatglm、baichuan 等
- llama

#domain:
##如 nlp、cv、audio、multi-modal
- nlp

#language:
##语言代码列表 https://help.aliyun.com/document_detail/215387.html?spm=a2c4g.11186623.0.0.9f8d7467kni6Aa
- cn
- zh

#metrics:
##如 CIDEr、Blue、ROUGE 等
#- CIDEr

#tags:
##各种自定义，包括 pretrained、fine-tuned、instruction-tuned、RL-tuned 等训练方法和其他
- fine-tuned

#tools:
##如 vllm、fastchat、llamacpp、AdaSeq 等
- vllm

---
# Unichat-llama3-Chinese-8B


## 介绍
* 中国联通AI创新中心发布llama3.1中文指令微调模型，全参数微调
* 基础模型 [**Meta-Llama-3.1-8B-Instruct**](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)


### 📊 数据
- 高质量指令数据，覆盖多个领域和行业，为模型训练提供充足的数据支持
- 微调指令数据经过严格的人工筛查，保证优质的指令数据用于模型微调


```python
import transformers
import torch

model_id = "UnicomAI/Unichat-llama3.1-Chinese-8B"

pipeline = transformers.pipeline(
    "text-generation",
    model=model_id,
    model_kwargs={"torch_dtype": torch.bfloat16},
    device_map="auto",
)

messages = [
    {"role": "system", "content": "You are a helpful assistant"},
    {"role": "user", "content": "你是谁?"},
]

outputs = pipeline(
    messages,
    max_new_tokens=1024,
    do_sample=False,
    repetition_penalty=1.1,
)
print(outputs[0]["generated_text"][-1])
```

## 资源
更多模型,数据集和训练相关细节请参考:
* Github：[**Unichat-llama3-Chinese**](https://github.com/UnicomAI/Unichat-llama3-Chinese)