Files
sft-conta-qwen2.5-7b-no-rl/all_results.json

8 lines
205 B
JSON
Raw Permalink Normal View History

{
"epoch": 5.0,
"total_flos": 3.54463086751976e+19,
"train_loss": 0.865001671439723,
"train_runtime": 46695.3235,
"train_samples_per_second": 1.271,
"train_steps_per_second": 0.01
}