Eros_Scribe-7b-imatrix-GGUF/README.md

---
license: other
language:
- en
pipeline_tag: text-generation
inference: false
tags:
- transformers
- gguf
- imatrix
- Eros_Scribe-7b
---
Quantizations of https://huggingface.co/OmnicromsBrain/Eros_Scribe-7b


### Open source inference clients/UIs
* [llama.cpp](https://github.com/ggerganov/llama.cpp)
* [KoboldCPP](https://github.com/LostRuins/koboldcpp)
* [ollama](https://github.com/ollama/ollama)
* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)
* [jan](https://github.com/janhq/jan)
* [GPT4All](https://github.com/nomic-ai/gpt4all)

### Closed source inference clients/UIs
* [LM Studio](https://lmstudio.ai/)
* [Backyard AI](https://backyard.ai/)
* More will be added...
---

# From original readme

This model was created for the purpose of writing **NSFW Prose** but it's also very good at **RP**.

Over a dozen models and at least 25 dataset were involved in this merge.
Eros_Scribe-7b is a merge of the following models:

* [OmnicromsBrain/EverythingBagel-DPO-7B](https://huggingface.co/OmnicromsBrain/EverythingBagel-DPO-7B)
  * jondurbin/bagel-dpo-7b-v0.5
  * SanjiWatsuki/Silicon-Maid-7B
    * chargoddard/loyal-piano-m7
    * NeverSleep/Noromaid-7b-v0.2
    * athirdpath/NSFW_DPO_vmgb-7b
    * xDAN-AI/xDAN-L1-Chat-RL-v1

* [OmnicromsBrain/ToppyCox-7B](https://huggingface.co/OmnicromsBrain/ToppyCox-7B)
  * N8Programs/Coxcomb
  * Undi95/Toppy-M-7B
    * openchat/openchat_3.5
    * NousResearch/Nous-Capybara-7B-V1.9
    * HuggingFaceH4/zephyr-7b-beta
    * Undi95/zephyr-7b-beta-pippa-sharegpt
    * Undi95/Nous-Capybara-7B-V1.9-120-Days
    * Undi95/openchat_3.5-LimaRP-13B
    * lemonilia/AshhLimaRP-Mistral-7B
    * mistralai/Mistral-7B-v0.1


BTW the name was suggested by Mistral 8x7b instruct


## 🧩 Configuration

```yaml
slices:
  - sources:
      - model: OmnicromsBrain/EverythingBagel-DPO-7B
        layer_range: [0, 32]
      - model: OmnicromsBrain/ToppyCox-7B
        layer_range: [0, 32]
merge_method: slerp
base_model: OmnicromsBrain/EverythingBagel-DPO-7B
parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5
dtype: bfloat16
```

## 💻 Usage

```python
!pip install -qU transformers accelerate

from transformers import AutoTokenizer
import transformers
import torch

model = "OmnicromsBrain/Eros_Scribe-7b"
messages = [{"role": "user", "content": "What is a large language model?"}]

tokenizer = AutoTokenizer.from_pretrained(model)
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
pipeline = transformers.pipeline(
    "text-generation",
    model=model,
    torch_dtype=torch.float16,
    device_map="auto",
)

outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])
```