ModelHub XC 5daa9d3078 初始化项目,由ModelHub XC社区提供模型
Model: nm-testing/Llama-2-7b-pruned40-retrained
Source: Original Platform
2026-06-16 08:19:13 +08:00

base_model, datasets
base_model datasets
meta-llama/Llama-2-7b-hf
cerebras/SlimPajama-627B

Checkpoint of a Llama 2 7B model that has had 50% of the parameters pruned in one-shot with SparseGPT, then retrained for 40B tokens from SlimPajama while maintaining sparsity.

  • Model: Llama 2
  • Size: 7B
  • LR: 3.00E-4
  • Dataset: SlimPajama
  • Retrained tokens: 40B
  • Notes: no warmup + decay to 0.0
  • Eval Harness:
    • CommonSense Reasoning: 62.2 (97.65%)
    • Reading Comprehension: 57.7 (98.30%)
    • World Knowledge: 42.4 (97.65%)
    • Math: 6.1 (74.39%)
    • Code: 16.2 (98.78%)
Description
Model synced from source: nm-testing/Llama-2-7b-pruned40-retrained
Readme 586 KiB