Model synced from source: xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_8
Updated 2026-04-29 06:05:50 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_2
Updated 2026-04-28 06:24:10 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_1
Updated 2026-04-28 05:47:13 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_3
Updated 2026-04-28 05:14:13 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_BaseAnchor_3B_step_3
Updated 2026-04-28 02:30:04 +08:00
Model synced from source: xw1234gan/SFT_Qwen2.5-7B-Instruct_MedQA
Updated 2026-04-28 02:27:08 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_BaseAnchor_3B_step_4
Updated 2026-04-28 01:38:09 +08:00
Model synced from source: xw1234gan/NuminaMath_Main_fixed_SFTanchor_1_5B_step_3
Updated 2026-04-28 01:16:40 +08:00
Model synced from source: xw1234gan/cnk12_GRPO_KL_Qwen2.5-1.5B-Instruct_beta0.01_lr1e-05_mb2_ga128_n2048_seed42
Updated 2026-04-28 00:51:16 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_5
Updated 2026-04-28 00:38:15 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_1
Updated 2026-04-28 00:25:29 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_2
Updated 2026-04-28 00:25:20 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_3
Updated 2026-04-27 23:37:17 +08:00
Model synced from source: xw1234gan/NuminaMath_Main_fixed_SFTanchor_1_5B_step_2
Updated 2026-04-27 14:24:57 +08:00
Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Updated 2026-04-27 13:32:06 +08:00