Files
P9-split4_only_answer_Qwen3…/all_results.json
ModelHub XC b28d78bc1c 初始化项目,由ModelHub XC社区提供模型
Model: Hyeongwon/P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6
Source: Original Platform
2026-04-18 19:34:56 +08:00

9 lines
231 B
JSON

{
"epoch": 6.0,
"total_flos": 509990855180288.0,
"train_loss": 0.558762426631948,
"train_runtime": 70510.0264,
"train_samples": 40998,
"train_samples_per_second": 3.489,
"train_steps_per_second": 0.007
}