ModelHub XC 403379031d 初始化项目,由ModelHub XC社区提供模型
Model: duyntnet/Yarn-Llama-2-7b-128k-imatrix-GGUF
Source: Original Platform
2026-06-17 16:32:17 +08:00

license, language, pipeline_tag, inference, tags
license language pipeline_tag inference tags
other
en
text-generation false
transformers
gguf
imatrix
Yarn-Llama-2-7b-128k

Quantizations of https://huggingface.co/NousResearch/Yarn-Llama-2-7b-128k

From original readme

Usage and Prompt Format

Install FA2 and Rotary Extensions:

pip install flash-attn --no-build-isolation
pip install git+https://github.com/HazyResearch/flash-attention.git#subdirectory=csrc/rotary

There are no specific prompt formats as this is a pretrained base model.

Description
Model synced from source: duyntnet/Yarn-Llama-2-7b-128k-imatrix-GGUF
Readme 28 KiB