37 lines
1.3 KiB
Markdown
37 lines
1.3 KiB
Markdown
---
|
|
license: llama2
|
|
tags:
|
|
- llama
|
|
- pytorch
|
|
- chatbot
|
|
- storywriting
|
|
- generalist-model
|
|
---
|
|
|
|
# chronos-13b-v2
|
|
|
|
This is the FP16 PyTorch / HF version of **chronos-13b-v2** based on the **LLaMA v2 Base** model.
|
|
|
|
Only use this version for further quantization or if you would like to run in full precision, as long as you have the VRAM required.
|
|
|
|
This model is primarily focused on chat, roleplay, storywriting, with good reasoning and logic.
|
|
|
|
Chronos can generate very long outputs with coherent text, largely due to the human inputs it was trained on, and it supports context length up to 4096 tokens.
|
|
|
|
This model uses Alpaca formatting, so for optimal model performance, use it to start the dialogue or story, and if you use a frontend like SillyTavern ENABLE instruction mode:
|
|
```
|
|
### Instruction:
|
|
Your instruction or question here.
|
|
### Response:
|
|
```
|
|
Not using the format will make the model perform significantly worse than intended.
|
|
|
|
## Other Versions
|
|
[4bit GPTQ Quantized version](https://huggingface.co/elinas/chronos-13b-v2-GPTQ)
|
|
|
|
[GGML Versions provided by @TheBloke](https://huggingface.co/TheBloke/Chronos-13B-v2-GGML)
|
|
|
|
**Support My Development of New Models**
|
|
<a href='https://ko-fi.com/Q5Q6MB734' target='_blank'><img height='36' style='border:0px;height:36px;'
|
|
src='https://storage.ko-fi.com/cdn/kofi1.png?v=3' border='0' alt='Support Development' /></a>
|