Files
llama3.2-1b-deita-dpo-stude…/train_results.json
ModelHub XC 9e60c86959 初始化项目,由ModelHub XC社区提供模型
Model: phanviethoang1512/llama3.2-1b-deita-dpo-student_sft_init
Source: Original Platform
2026-04-22 19:47:57 +08:00

9 lines
230 B
JSON

{
"epoch": 3.0,
"total_flos": 90953314467840.0,
"train_loss": 0.9790556504583052,
"train_runtime": 12705.6497,
"train_samples": 9500,
"train_samples_per_second": 8.618,
"train_steps_per_second": 0.135
}