Files
openthoughts3_100k_qwen25_1…/train_results.json

8 lines
220 B
JSON
Raw Normal View History

{
"epoch": 4.987212276214834,
"total_flos": 7100358600687616.0,
"train_loss": 1.0860105241261995,
"train_runtime": 35157.9071,
"train_samples_per_second": 14.222,
"train_steps_per_second": 0.028
}