初始化项目,由ModelHub XC社区提供模型
Model: Nabbers1999/Mini-Llama-14B-Chat-SFT-0129 Source: Original Platform
This commit is contained in:
27
README.md
Normal file
27
README.md
Normal file
@@ -0,0 +1,27 @@
|
||||
---
|
||||
base_model: mistralai/Ministral-3-14B-Base-2512
|
||||
library_name: transformers
|
||||
tags:
|
||||
- ministral-3
|
||||
- text-generation
|
||||
- instruct
|
||||
- llamafied
|
||||
- novision
|
||||
license: apache-2.0
|
||||
language:
|
||||
- en
|
||||
---
|
||||
|
||||

|
||||
|
||||
# Mini-Llama 14B Chat - 0129
|
||||
|
||||
My instruct model has undergone DoRA SFT on my custom synthetic chat dataset, containing single and multi-round chats containing SFW, NSFW, and Toxic single and multi-round chats. This reinforces the model's uncensored compliance with all prompts and teaches it how to better fill roles assigned to it in the system prompt.
|
||||
|
||||
This model has yet to go through DPO preference training and may still have rough edges.
|
||||
|
||||
** Be aware that this adapter, when used without a system prompt to assign it a role may make up its own role. Meaning if you just say 'Hello' it could resond with 'Hello, how may I assist you?' or it might respond with something like "Hi, my name is Carol and I'm a librarian here to assist you with finding the book you're looking for."
|
||||
|
||||
For the base pretrain, see: [Nabbers1999/Mini-Llama-14B-Base-0124](https://huggingface.co/Nabbers1999/Mini-Llama-14B-Base-0124)
|
||||
|
||||
For the instruct, see: [Nabbers1999/Mini-Llama-14B-Instruct-0124](https://huggingface.co/Nabbers1999/Mini-Llama-14B-Instruct-0124)
|
||||
Reference in New Issue
Block a user