初始化项目，由ModelHub XC社区提供模型

Model: duyntnet/Llama-2-7b-chat-hf-imatrix-GGUF Source: Original Platform
2026-06-17 16:17:16 +08:00
commit 02dcd49140
29 changed files with 164 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,21 @@
+---
+license: other
+language:
+- en
+pipeline_tag: text-generation
+inference: false
+tags:
+- transformers
+- gguf
+- imatrix
+- Llama-2-7b-chat-hf
+---
+Quantizations of https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
+
+
+# From original readme
+
+## Intended Use
+**Intended Use Cases** Llama 2 is intended for commercial and research use in English. Tuned models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks.
+
+To get the expected features and performance for the chat versions, a specific formatting needs to be followed, including the `INST` and `<<SYS>>` tags, `BOS` and `EOS` tokens, and the whitespaces and breaklines in between (we recommend calling `strip()` on inputs to avoid double-spaces). See our reference code in github for details: [`chat_completion`](https://github.com/facebookresearch/llama/blob/main/llama/generation.py#L212).