47 lines
1.7 KiB
Markdown
47 lines
1.7 KiB
Markdown
---
|
|
license: llama3
|
|
pipeline_tag: text-generation
|
|
base_model: OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3.2-GGUF
|
|
---
|
|
|
|
# OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3.2-GGUF
|
|
This is quantized version of [OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3.2-GGUF](https://huggingface.co/OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3.2-GGUF) created using llama.cpp
|
|
|
|
# Model Description
|
|
Based on Meta-Llama-3-8b-Instruct, and is governed by Meta Llama 3 License agreement:
|
|
https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
|
|
|
|
|
|
This v0.3.2 version is even more uncensored thanks to using https://huggingface.co/AwanLLM/Awanllm-Llama-3-8B-Dolfin-v0.6-Abliterated as the base model. The 0.0.2 is for slight adjustment to the DPO stage.
|
|
|
|
|
|
In terms of reasoning and intelligence, this model is probably a bit worse than the OG model because of the decensoring. However, this model is better at long back and forth chats and will refuse less.
|
|
|
|
|
|
This model works best with system prompts that tells it that it is the character, instead of telling it to act as a character.
|
|
|
|
|
|
Training:
|
|
- Full 8192 sequence length.
|
|
- Training duration is around 2 days on an RTX 4090, using 4-bit loading and Qlora 64-rank 64-alpha resulting in ~2% trainable weights.
|
|
|
|
|
|
Instruct format:
|
|
```
|
|
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
|
|
|
|
{{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
|
|
|
|
{{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
|
|
|
|
{{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
|
|
|
|
{{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
|
|
```
|
|
|
|
|
|
Quants:
|
|
|
|
FP16: https://huggingface.co/OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3.2
|
|
|
|
GGUF: https://huggingface.co/OwenArli/ArliAI-Llama-3-8B-Cumulus-v0.3.2-GGUF |