Files

ModelHub XC a26a31c63d 初始化项目，由ModelHub XC社区提供模型

Model: pmahdavi/Llama-3.1-8B-math-reasoning
Source: Original Platform

2026-06-12 02:07:16 +08:00

language, license, library_name, tags, pipeline_tag, model-index, base_model

language

license

library_name

tags

pipeline_tag

model-index

base_model

cc-by-nc-4.0

transformers

llama

math

reasoning

fine-tuned

fine-tuning

text-generation

name

results

Llama-3.1-8B-math-reasoning

task

dataset

metrics

type	name
text-generation	Text Generation

name	type
tulu3_mixture_math_reasoning	custom

name	type	value
Training Loss	loss	0.98

meta-llama/Llama-3.1-8B

Llama-3.1-8B Math Reasoning Model

Llama-3.1-8B SFT checkpoints for mathematical reasoning—artifacts of https://arxiv.org/abs/2509.11167.

Model Details

This repository includes export files for state averaging and other advanced techniques.