ModelHub XC a26a31c63d 初始化项目,由ModelHub XC社区提供模型
Model: pmahdavi/Llama-3.1-8B-math-reasoning
Source: Original Platform
2026-06-12 02:07:16 +08:00

language, license, library_name, tags, pipeline_tag, model-index, base_model
language license library_name tags pipeline_tag model-index base_model
en
cc-by-nc-4.0 transformers
llama
math
reasoning
fine-tuned
fine-tuning
text-generation
name results
Llama-3.1-8B-math-reasoning
task dataset metrics
type name
text-generation Text Generation
name type
tulu3_mixture_math_reasoning custom
name type value
Training Loss loss 0.98
meta-llama/Llama-3.1-8B

Llama-3.1-8B Math Reasoning Model

Llama-3.1-8B SFT checkpoints for mathematical reasoning—artifacts of https://arxiv.org/abs/2509.11167.

Model Details

  • Base model: Llama-3.1-8B
  • Training dataset: tulu3_mixture_math_reasoning
  • Learning rate: 5e-06
  • Effective batch size: 128

Export Files

This repository includes export files for state averaging and other advanced techniques.

Description
Model synced from source: pmahdavi/Llama-3.1-8B-math-reasoning
Readme 16 MiB
Languages
Python 100%