Files
smollm2-135M-SFT-Only/train_results.json

9 lines
232 B
JSON
Raw Normal View History

{
"epoch": 2.0,
"total_flos": 173253108695040.0,
"train_loss": 1.2838877389321521,
"train_runtime": 2377.8575,
"train_samples": 456544,
"train_samples_per_second": 42.195,
"train_steps_per_second": 0.33
}