Model synced from source: CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch
Updated 2026-06-10 15:50:24 +08:00
Model synced from source: CompassioninMachineLearning/pretrainingBasellama3kv3
Updated 2026-06-09 07:43:18 +08:00
Model synced from source: CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch
Updated 2026-05-07 08:39:48 +08:00