Files
Qwen1.5-MOE-sft-gsm/train_results.json
ModelHub XC 87d94517b0 初始化项目,由ModelHub XC社区提供模型
Model: jayzou3773/Qwen1.5-MOE-sft-gsm
Source: Original Platform
2026-05-26 22:22:22 +08:00

8 lines
217 B
JSON

{
"total_flos": 3.6134539308407194e+17,
"train_loss": 0.15647654232178998,
"train_runtime": 1906.4028,
"train_samples": 7473,
"train_samples_per_second": 7.84,
"train_steps_per_second": 0.245
}