a26a31c63de94443ab836104cad13ca478254b18
Model: pmahdavi/Llama-3.1-8B-math-reasoning Source: Original Platform
language, license, library_name, tags, pipeline_tag, model-index, base_model
| language | license | library_name | tags | pipeline_tag | model-index | base_model | |||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
cc-by-nc-4.0 | transformers |
|
text-generation |
|
meta-llama/Llama-3.1-8B |
Llama-3.1-8B Math Reasoning Model
Llama-3.1-8B SFT checkpoints for mathematical reasoning—artifacts of https://arxiv.org/abs/2509.11167.
Model Details
- Base model: Llama-3.1-8B
- Training dataset: tulu3_mixture_math_reasoning
- Learning rate: 5e-06
- Effective batch size: 128
Export Files
This repository includes export files for state averaging and other advanced techniques.
Description
Languages
Python
100%