初始化项目,由ModelHub XC社区提供模型

Model: Cisco1963/llmplasticity-en_zh_linear_0.125_1-seed42
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-01 06:20:27 +08:00
commit ee5e46d8ff
158 changed files with 4508267 additions and 0 deletions

9
train_results.json Normal file
View File

@@ -0,0 +1,9 @@
{
"epoch": 1.0,
"total_flos": 3.56955225587712e+17,
"train_loss": 57.429637563704375,
"train_runtime": 52919.1561,
"train_samples": 683058,
"train_samples_per_second": 12.908,
"train_steps_per_second": 0.101
}