初始化项目，由ModelHub XC社区提供模型

Model: shb777/Llama-3.3-8B-Instruct-128K Source: Original Platform
2026-05-02 06:39:18 +08:00
commit 4002f5d4d4
13 changed files with 2667 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,81 @@
+---
+license: llama3.3
+base_model:
+- allura-forge/Llama-3.3-8B-Instruct
+pipeline_tag: text-generation
+model-index:
+  - name: shb777/Llama-3.3-8B-Instruct-128K
+    results:
+      - task:
+          type: text-generation
+        dataset:
+          name: BBH
+          type: leaderboard
+        metrics:
+          - type: accuracy
+            value: 54.1
+            name: acc_norm
+      - task:
+          type: text-generation
+        dataset:
+          name: GPQA
+          type: leaderboard
+        metrics:
+          - type: accuracy
+            value: 29.9
+            name: acc_norm
+      - task:
+          type: text-generation
+        dataset:
+          name: MMLU Pro
+          type: leaderboard
+        metrics:
+          - type: accuracy
+            value: 38.0
+            name: acc
+      - task:
+          type: text-generation
+        dataset:
+          name: MuSR
+          type: leaderboard
+        metrics:
+          - type: accuracy
+            value: 37.8
+            name: acc_norm
+      - task:
+          type: text-generation
+        dataset:
+          name: IFEval
+          type: leaderboard
+        metrics:
+          - type: accuracy
+            value: 85.2
+            name: avg(prompt_strict + inst_strict)
+      - task:
+          type: text-generation
+        dataset:
+          name: MATH Hard
+          type: leaderboard
+        metrics:
+          - type: accuracy
+            value: 27.3
+            name: exact_match
+---
+
+# Llama 3.3 8B 128K Instruct (Fixed)
+
+> [!IMPORTANT]
+> Original [allura-forge/Llama-3.3-8B-Instruct](https://huggingface.co/allura-forge/Llama-3.3-8B-Instruct), Thanks!
+
+> [!TIP]
+> [imatrix GGUF's by mradermacher (Recommended)](https://huggingface.co/mradermacher/Llama-3.3-8B-Instruct-128K-i1-GGUF)
+> 
+> [static GGUF's](https://huggingface.co/shb777/Llama-3.3-8B-Instruct-128K-GGUF)
+> 
+> [Evals](https://huggingface.co/datasets/shb777/Llama-3.3-8B-Instruct-128K-Evals)
+
+Additional Fixes:
+- Added `rope_scaling`
+- Added chat template (Unsloth) in tokenizer config
+- Updated generation config
+- Enabled full context length