Files
SmolLM2-1.7B-16k-SFT-Tulu3-…/train_results.json

8 lines
216 B
JSON
Raw Normal View History

{
"total_flos": 3199489317601280.0,
"train_loss": 0.9069533730784906,
"train_runtime": 41137.9828,
"train_samples": 936509,
"train_samples_per_second": 45.53,
"train_steps_per_second": 0.356
}