Files
TinyLlama-1.1B-1.5T-OpenOrc…/README.md
ModelHub XC 9d3aa37152 初始化项目,由ModelHub XC社区提供模型
Model: jeff31415/TinyLlama-1.1B-1.5T-OpenOrca-Alpha
Source: Original Platform
2026-05-17 01:30:17 +08:00

1.2 KiB

license, datasets, language
license datasets language
apache-2.0
Open-Orca/OpenOrca
bigcode/starcoderdata
cerebras/SlimPajama-627B
en

Built with Axolotl

Base model:

https://huggingface.co/TinyLlama/tinyLlama-intermediate-checkpoints/tree/step-720k-token-1510B This fine tune was done on the "early" version of tinyllama-1.5T which suffers from a bug in dataset processing. See https://github.com/jzhang38/TinyLlama/issues/67. Through it suffers from the glitch, its performance seems not being damaged and still showing improvement(metrics needed)

Dataset:

Fine tuned on OpenOrca GPT4 subset for 1 epoch,Using CHATML format

Model License:

Apache 2.0, following the TinyLlama base model.

Quantisation:

GGUF format:https://huggingface.co/s3nh/jeff31415-TinyLlama-1.1B-1.5T-OpenOrca-Alpha-GGUF

Hardware and training details:

Hardware: 1*RTX A5000, ~16 hours to complete 1 epoch. GPU from autodl.com, cost around $3 for this finetuning. https://wandb.ai/jeff200402/TinyLlama-1.5T-alpha-Orca?workspace= for more details.