Model synced from source: xw1234gan/SFT_Qwen2.5-1.5B-Instruct_cnk12
Updated 2026-04-27 12:00:04 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_4
Updated 2026-04-27 11:40:47 +08:00
Model synced from source: xw1234gan/SMOKE_GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0_lr1e-05_mb2_ga4_n16_seed42_HF_GEN
Updated 2026-04-27 05:12:07 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_9
Updated 2026-04-27 00:49:04 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_6
Updated 2026-04-26 19:17:13 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_5
Updated 2026-04-26 18:10:19 +08:00
Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Updated 2026-04-25 16:11:05 +08:00
Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Updated 2026-04-25 14:16:07 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_7B_step_4
Updated 2026-04-25 13:18:39 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_7B_step_5
Updated 2026-04-25 12:20:07 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_7B_step_6
Updated 2026-04-25 11:36:04 +08:00
Model synced from source: xw1234gan/Main_fixed02_MATH_3B_step_2
Updated 2026-04-24 22:43:41 +08:00
Model synced from source: xw1234gan/Main_fixed02_MATH_3B_step_1
Updated 2026-04-23 17:30:20 +08:00
Model synced from source: xw1234gan/SFT_Qwen2.5-7B-Instruct_MMLU
Updated 2026-04-23 14:41:11 +08:00
Model synced from source: xw1234gan/Main_fixed02_MATH_3B_step_6
Updated 2026-04-22 17:08:50 +08:00