Files
sft__ot30k_SmolLM2-1.7B-16k…/all_results.json

12 lines
335 B
JSON
Raw Normal View History

{
"epoch": 5.0,
"loss_nan_ranks": 0,
"loss_rank_avg": 0.49867311120033264,
"total_flos": 1367557351931904.0,
"train_loss": 1.0911384893985505,
"train_runtime": 4706.648,
"train_samples_per_second": 31.87,
"train_steps_per_second": 0.25,
"valid_targets_mean": 13716.8,
"valid_targets_min": 3353
}