初始化项目，由ModelHub XC社区提供模型

Model: wang7776/Llama-2-7b-chat-hf-20-attention-sparsity Source: Original Platform
2026-05-06 06:37:31 +08:00
commit 5ed30f2353
12 changed files with 714 additions and 0 deletions
--- a/generation_config.json
+++ b/generation_config.json
@@ -0,0 +1,10 @@
+{
+  "bos_token_id": 1,
+  "do_sample": true,
+  "eos_token_id": 2,
+  "max_length": 4096,
+  "pad_token_id": 0,
+  "temperature": 0.6,
+  "top_p": 0.9,
+  "transformers_version": "4.36.1"
+}