Files
Baguettotron-longsft_16k-SF…/train_results.json
ModelHub XC 9b1711112b 初始化项目,由ModelHub XC社区提供模型
Model: ali-elganzory/Baguettotron-longsft_16k-SFT-Tulu3-decontaminated
Source: Original Platform
2026-04-28 08:30:06 +08:00

8 lines
217 B
JSON

{
"total_flos": 2710644215513088.0,
"train_loss": 1.6633207928015101,
"train_runtime": 25736.6854,
"train_samples": 936509,
"train_samples_per_second": 72.776,
"train_steps_per_second": 0.569
}