初始化项目,由ModelHub XC社区提供模型
Model: Madras1/Jade8b-GGUF Source: Original Platform
This commit is contained in:
41
.gitattributes
vendored
Normal file
41
.gitattributes
vendored
Normal file
@@ -0,0 +1,41 @@
|
||||
*.7z filter=lfs diff=lfs merge=lfs -text
|
||||
*.arrow filter=lfs diff=lfs merge=lfs -text
|
||||
*.bin filter=lfs diff=lfs merge=lfs -text
|
||||
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
||||
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
||||
*.ftz filter=lfs diff=lfs merge=lfs -text
|
||||
*.gz filter=lfs diff=lfs merge=lfs -text
|
||||
*.h5 filter=lfs diff=lfs merge=lfs -text
|
||||
*.joblib filter=lfs diff=lfs merge=lfs -text
|
||||
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
||||
*.model filter=lfs diff=lfs merge=lfs -text
|
||||
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
||||
*.npy filter=lfs diff=lfs merge=lfs -text
|
||||
*.npz filter=lfs diff=lfs merge=lfs -text
|
||||
*.onnx filter=lfs diff=lfs merge=lfs -text
|
||||
*.ot filter=lfs diff=lfs merge=lfs -text
|
||||
*.parquet filter=lfs diff=lfs merge=lfs -text
|
||||
*.pb filter=lfs diff=lfs merge=lfs -text
|
||||
*.pickle filter=lfs diff=lfs merge=lfs -text
|
||||
*.pkl filter=lfs diff=lfs merge=lfs -text
|
||||
*.pt filter=lfs diff=lfs merge=lfs -text
|
||||
*.pth filter=lfs diff=lfs merge=lfs -text
|
||||
*.rar filter=lfs diff=lfs merge=lfs -text
|
||||
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
||||
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
||||
*.tar filter=lfs diff=lfs merge=lfs -text
|
||||
*.tflite filter=lfs diff=lfs merge=lfs -text
|
||||
*.tgz filter=lfs diff=lfs merge=lfs -text
|
||||
*.wasm filter=lfs diff=lfs merge=lfs -text
|
||||
*.xz filter=lfs diff=lfs merge=lfs -text
|
||||
*.zip filter=lfs diff=lfs merge=lfs -text
|
||||
*.zst filter=lfs diff=lfs merge=lfs -text
|
||||
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
||||
jade8b-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
jade8b-q2_k.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
jade8b-q3_k_m.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
jade8b-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
jade8b-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
jade8b-q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
152
README.md
Normal file
152
README.md
Normal file
@@ -0,0 +1,152 @@
|
||||
---
|
||||
language:
|
||||
- pt
|
||||
- en
|
||||
license: apache-2.0
|
||||
base_model: unsloth/qwen3-8b-bnb-4bit
|
||||
base_model_relation: finetune
|
||||
library_name: transformers
|
||||
pipeline_tag: text-generation
|
||||
tags:
|
||||
- pt-br
|
||||
- portuguese
|
||||
- brazilian-portuguese
|
||||
- conversational
|
||||
- chatbot
|
||||
- persona
|
||||
- qwen2
|
||||
- qwen2.5
|
||||
- unsloth
|
||||
- 4-bit
|
||||
- bitsandbytes
|
||||
---
|
||||
|
||||
# Jade8b
|
||||
|
||||
Jade8b is a Brazilian Portuguese conversational finetune of Qwen3 8b built to express a strong, persistent persona. This model is designed for PT-BR chat, chatbot use cases, and character-style interaction, with colloquial language, abbreviations, slang, and a WhatsApp-like tone.
|
||||
|
||||
## Model Summary
|
||||
|
||||
Jade8b is a persona-first model. It was intentionally finetuned so the model speaks like **Jade** even without a strong `system prompt`. Because of that, the model often answers in PT-BR with informal phrasing such as `vc`, slang, and a friendly conversational tone from the very first turn.
|
||||
|
||||
## Model Details
|
||||
|
||||
- Developed by: `Madras1`
|
||||
- Base model: `unsloth/qwen3-8b-bnb-4bit`
|
||||
- Model type: conversational text-generation finetune
|
||||
- Primary language: Brazilian Portuguese (`pt-BR`)
|
||||
- License: `apache-2.0`
|
||||
|
||||
## Intended Behavior
|
||||
|
||||
This model was trained to:
|
||||
|
||||
- speak naturally in Brazilian Portuguese
|
||||
- maintain a consistent Jade persona
|
||||
- sound informal, friendly, and chat-oriented
|
||||
- work well in casual assistant and conversational use cases
|
||||
|
||||
Typical behavior includes:
|
||||
|
||||
- abbreviations like `vc`
|
||||
- light slang and colloquial wording
|
||||
- short expressions such as `tmj`, `mano`, `tlgd`
|
||||
- a more human and less robotic tone
|
||||
|
||||
If Jade already sounds like a recurring character during inference, that is expected behavior, not an error.
|
||||
|
||||
## Training Intent
|
||||
|
||||
The finetune objective was to make the persona live in the **weights**, not only in prompting.
|
||||
|
||||
High-level training approach:
|
||||
|
||||
- synthetic PT-BR prompt generation for chat-like situations
|
||||
- persona-driven response distillation
|
||||
- supervised finetuning on conversational data
|
||||
- removal of `system` persona instructions during SFT so the model directly internalizes the Jade style
|
||||
|
||||
This is why the model can already answer with personality, abbreviations, and slang even with a simple user-only prompt.
|
||||
|
||||
## Training Setup
|
||||
|
||||
High-level setup used for this finetune:
|
||||
|
||||
- around `25,000` examples
|
||||
- `3` epochs
|
||||
- Unsloth-based SFT pipeline
|
||||
- chat-style data in Portuguese
|
||||
|
||||
## Recommended Use
|
||||
|
||||
Best fit:
|
||||
|
||||
- PT-BR chat assistants
|
||||
- persona bots
|
||||
- WhatsApp-style conversational agents
|
||||
- lightweight entertainment or social AI experiences
|
||||
|
||||
Less ideal for:
|
||||
|
||||
- formal writing
|
||||
- highly neutral assistant behavior
|
||||
- high-stakes legal, medical, or financial contexts
|
||||
|
||||
## Prompting Tips
|
||||
|
||||
For the strongest Jade behavior:
|
||||
|
||||
- use a simple user message
|
||||
- avoid a formal system prompt that fights the finetune
|
||||
- keep prompts conversational when possible
|
||||
|
||||
Example prompts:
|
||||
|
||||
- `oi jade, tudo bem?`
|
||||
- `jade, me explica isso de um jeito simples`
|
||||
- `vc acha que vale a pena estudar python hoje?`
|
||||
|
||||
## Example Inference
|
||||
|
||||
```python
|
||||
from transformers import AutoModelForCausalLM, AutoTokenizer
|
||||
import torch
|
||||
|
||||
model_id = "Madras1/Jade8b"
|
||||
|
||||
tokenizer = AutoTokenizer.from_pretrained(model_id)
|
||||
model = AutoModelForCausalLM.from_pretrained(
|
||||
model_id,
|
||||
torch_dtype=torch.bfloat16,
|
||||
device_map="auto",
|
||||
)
|
||||
|
||||
messages = [
|
||||
{"role": "user", "content": "oi jade, tudo bem?"}
|
||||
]
|
||||
|
||||
text = tokenizer.apply_chat_template(
|
||||
messages,
|
||||
tokenize=False,
|
||||
add_generation_prompt=True,
|
||||
)
|
||||
|
||||
inputs = tokenizer(text, return_tensors="pt").to(model.device)
|
||||
outputs = model.generate(
|
||||
**inputs,
|
||||
max_new_tokens=256,
|
||||
temperature=0.7,
|
||||
top_p=0.9,
|
||||
)
|
||||
|
||||
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
||||
```
|
||||
|
||||
## Limitations
|
||||
|
||||
Because this is a persona-oriented finetune:
|
||||
|
||||
- it may sound informal in contexts where a neutral tone would be better
|
||||
- it may over-index on chat style depending on the prompt
|
||||
- it is optimized more for persona consistency than strict formality
|
||||
|
||||
3
jade8b-q2_k.gguf
Normal file
3
jade8b-q2_k.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:54ae1cf8d872791bc41c06c9938f608c9ac4e0ce8792f0569c1bb93986be676b
|
||||
size 3281728640
|
||||
3
jade8b-q3_k_m.gguf
Normal file
3
jade8b-q3_k_m.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:fa62870c748d765e56186967858e44dfb8c488a8a12292b6bed4780d3a1a7b97
|
||||
size 4124157056
|
||||
3
jade8b-q4_k_m.gguf
Normal file
3
jade8b-q4_k_m.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:ac3b2ed95fa6aba04847fe409126d7f49f7c5fac968b1a97a01331d2786d2d2e
|
||||
size 5027779712
|
||||
3
jade8b-q5_k_m.gguf
Normal file
3
jade8b-q5_k_m.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:aea942cd3ad8d38a587d568a39eadfa3d65a595acab49ee5ea5646c71bb9c893
|
||||
size 5851108480
|
||||
3
jade8b-q6_k.gguf
Normal file
3
jade8b-q6_k.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4ae173c4d7a538482e67e798a329b3766fdf0e29f33030a93233165ab2f1cacd
|
||||
size 6725895296
|
||||
3
jade8b-q8_0.gguf
Normal file
3
jade8b-q8_0.gguf
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:96c76677ba5d63a7ef9bd937753dfdee0d4ffd62fb64bb6cbc8aa939065da7aa
|
||||
size 8709514368
|
||||
Reference in New Issue
Block a user