初始化项目,由ModelHub XC社区提供模型

Model: shb777/Llama-3.3-8B-Instruct-128K
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-02 06:39:18 +08:00
commit 4002f5d4d4
13 changed files with 2667 additions and 0 deletions

81
README.md Normal file
View File

@@ -0,0 +1,81 @@
---
license: llama3.3
base_model:
- allura-forge/Llama-3.3-8B-Instruct
pipeline_tag: text-generation
model-index:
- name: shb777/Llama-3.3-8B-Instruct-128K
results:
- task:
type: text-generation
dataset:
name: BBH
type: leaderboard
metrics:
- type: accuracy
value: 54.1
name: acc_norm
- task:
type: text-generation
dataset:
name: GPQA
type: leaderboard
metrics:
- type: accuracy
value: 29.9
name: acc_norm
- task:
type: text-generation
dataset:
name: MMLU Pro
type: leaderboard
metrics:
- type: accuracy
value: 38.0
name: acc
- task:
type: text-generation
dataset:
name: MuSR
type: leaderboard
metrics:
- type: accuracy
value: 37.8
name: acc_norm
- task:
type: text-generation
dataset:
name: IFEval
type: leaderboard
metrics:
- type: accuracy
value: 85.2
name: avg(prompt_strict + inst_strict)
- task:
type: text-generation
dataset:
name: MATH Hard
type: leaderboard
metrics:
- type: accuracy
value: 27.3
name: exact_match
---
# Llama 3.3 8B 128K Instruct (Fixed)
> [!IMPORTANT]
> Original [allura-forge/Llama-3.3-8B-Instruct](https://huggingface.co/allura-forge/Llama-3.3-8B-Instruct), Thanks!
> [!TIP]
> [imatrix GGUF's by mradermacher (Recommended)](https://huggingface.co/mradermacher/Llama-3.3-8B-Instruct-128K-i1-GGUF)
>
> [static GGUF's](https://huggingface.co/shb777/Llama-3.3-8B-Instruct-128K-GGUF)
>
> [Evals](https://huggingface.co/datasets/shb777/Llama-3.3-8B-Instruct-128K-Evals)
Additional Fixes:
- Added `rope_scaling`
- Added chat template (Unsloth) in tokenizer config
- Updated generation config
- Enabled full context length