ModelHub XC 02dcd49140 初始化项目,由ModelHub XC社区提供模型
Model: duyntnet/Llama-2-7b-chat-hf-imatrix-GGUF
Source: Original Platform
2026-06-17 16:17:16 +08:00

license, language, pipeline_tag, inference, tags
license language pipeline_tag inference tags
other
en
text-generation false
transformers
gguf
imatrix
Llama-2-7b-chat-hf

Quantizations of https://huggingface.co/meta-llama/Llama-2-7b-chat-hf

From original readme

Intended Use

Intended Use Cases Llama 2 is intended for commercial and research use in English. Tuned models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks.

To get the expected features and performance for the chat versions, a specific formatting needs to be followed, including the INST and <<SYS>> tags, BOS and EOS tokens, and the whitespaces and breaklines in between (we recommend calling strip() on inputs to avoid double-spaces). See our reference code in github for details: chat_completion.

Description
Model synced from source: duyntnet/Llama-2-7b-chat-hf-imatrix-GGUF
Readme 28 KiB