初始化项目，由ModelHub XC社区提供模型

Model: duyntnet/Yarn-Llama-2-7b-128k-imatrix-GGUF Source: Original Platform
2026-06-17 16:32:17 +08:00
commit 403379031d
29 changed files with 169 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,26 @@
+---
+license: other
+language:
+- en
+pipeline_tag: text-generation
+inference: false
+tags:
+- transformers
+- gguf
+- imatrix
+- Yarn-Llama-2-7b-128k
+---
+Quantizations of https://huggingface.co/NousResearch/Yarn-Llama-2-7b-128k
+
+
+# From original readme
+
+## Usage and Prompt Format
+
+Install FA2 and Rotary Extensions:
+```
+pip install flash-attn --no-build-isolation
+pip install git+https://github.com/HazyResearch/flash-attention.git#subdirectory=csrc/rotary
+```
+
+There are no specific prompt formats as this is a pretrained base model.