初始化项目,由ModelHub XC社区提供模型
Model: duyntnet/Llama-2-7b-chat-hf-imatrix-GGUF Source: Original Platform
This commit is contained in:
21
README.md
Normal file
21
README.md
Normal file
@@ -0,0 +1,21 @@
|
||||
---
|
||||
license: other
|
||||
language:
|
||||
- en
|
||||
pipeline_tag: text-generation
|
||||
inference: false
|
||||
tags:
|
||||
- transformers
|
||||
- gguf
|
||||
- imatrix
|
||||
- Llama-2-7b-chat-hf
|
||||
---
|
||||
Quantizations of https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
|
||||
|
||||
|
||||
# From original readme
|
||||
|
||||
## Intended Use
|
||||
**Intended Use Cases** Llama 2 is intended for commercial and research use in English. Tuned models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks.
|
||||
|
||||
To get the expected features and performance for the chat versions, a specific formatting needs to be followed, including the `INST` and `<<SYS>>` tags, `BOS` and `EOS` tokens, and the whitespaces and breaklines in between (we recommend calling `strip()` on inputs to avoid double-spaces). See our reference code in github for details: [`chat_completion`](https://github.com/facebookresearch/llama/blob/main/llama/generation.py#L212).
|
||||
Reference in New Issue
Block a user