初始化项目,由ModelHub XC社区提供模型
Model: stephenlzc/Mistral-7B-v0.3-Chinese-Chat-uncensored Source: Original Platform
This commit is contained in:
89
README.md
Normal file
89
README.md
Normal file
@@ -0,0 +1,89 @@
|
||||
---
|
||||
base_model: shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat
|
||||
datasets:
|
||||
- Minami-su/toxic-sft-zh
|
||||
- llm-wizard/alpaca-gpt4-data-zh
|
||||
- stephenlzc/stf-alpaca
|
||||
language:
|
||||
- zh
|
||||
license: mit
|
||||
pipeline_tag: text-generation
|
||||
tags:
|
||||
- text-generation-inference
|
||||
- code
|
||||
- unsloth
|
||||
- uncensored
|
||||
- finetune
|
||||
task_categories:
|
||||
- conversational
|
||||
widget:
|
||||
- text: >-
|
||||
Is this review positive or negative? Review: Best cast iron skillet you will
|
||||
ever buy.
|
||||
example_title: Sentiment analysis
|
||||
- text: >-
|
||||
Barack Obama nominated Hilary Clinton as his secretary of state on Monday.
|
||||
He chose her because she had ...
|
||||
example_title: Coreference resolution
|
||||
- text: >-
|
||||
On a shelf, there are five books: a gray book, a red book, a purple book, a
|
||||
blue book, and a black book ...
|
||||
example_title: Logic puzzles
|
||||
- text: >-
|
||||
The two men running to become New York City's next mayor will face off in
|
||||
their first debate Wednesday night ...
|
||||
example_title: Reading comprehension
|
||||
---
|
||||
|
||||
|
||||
## Model Details
|
||||
|
||||
### Model Description
|
||||
|
||||
- Using **shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat** as base model, and finetune the dataset as mentioned via **[unsloth](https://github.com/unslothai/unsloth)**. Makes the model uncensored.
|
||||
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
||||
### Training Code
|
||||
|
||||
- [](https://colab.research.google.com/drive/1K9stY8LMVcySG0jDMYZdWQCFPfoDFBL-?usp=sharing)
|
||||
|
||||
### Training Procedure Raw Files
|
||||
|
||||
- ALL the procedure are training on **[Vast.ai](https://cloud.vast.ai/?ref_id=138637)**
|
||||
|
||||
|
||||
- **Hardware in Vast.ai**:
|
||||
|
||||
- **GPU**: 1x A100 SXM4 80GB
|
||||
|
||||
- **CPU**: AMD EPYC 7513 32-Core Processor
|
||||
|
||||
- **RAM**: 129 GB
|
||||
|
||||
- **Disk Space To Allocate**:>150GB
|
||||
|
||||
- **Docker Image**: pytorch/pytorch:2.2.0-cuda12.1-cudnn8-devel
|
||||
|
||||
- Download the **[ipynb file](https://huggingface.co/stephenlzc/Mistral-7B-v0.3-Chinese-Chat-uncensored/blob/main/Mistral-7B-v0.3-Chinese-Chat-uncensored.ipynb)**.
|
||||
|
||||
|
||||
### Training Data
|
||||
- **Base Model**
|
||||
- [shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat](https://huggingface.co/shenzhi-wang/Mistral-7B-v0.3-Chinese-Chat)
|
||||
|
||||
- **Dataset**
|
||||
- [Minami-su/toxic-sft-zh](https://huggingface.co/datasets/Minami-su/toxic-sft-zh)
|
||||
- [llm-wizard/alpaca-gpt4-data-zh](https://huggingface.co/datasets/llm-wizard/alpaca-gpt4-data-zh)
|
||||
- [stephenlzc/stf-alpaca](https://huggingface.co/datasets/stephenlzc/stf-alpaca)
|
||||
|
||||
|
||||
### Usage
|
||||
```python
|
||||
from transformers import pipeline
|
||||
|
||||
qa_model = pipeline("question-answering", model='stephenlzc/Mistral-7B-v0.3-Chinese-Chat-uncensored')
|
||||
question = "How to make girlfreind laugh? please answer in Chinese."
|
||||
qa_model(question = question)
|
||||
```
|
||||
|
||||
###
|
||||
[<img src="https://img.buymeacoffee.com/button-api/?text=Buy me a coffee&emoji=&slug=chicongliau&button_colour=40DCA5&font_colour=ffffff&font_family=Poppins&outline_colour=000000&coffee_colour=FFDD00" width="200"/>](https://www.buymeacoffee.com/chicongliau)
|
||||
Reference in New Issue
Block a user