Model synced from source: ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_3000
Updated 2026-05-30 23:56:26 +08:00
Model synced from source: ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4000
Updated 2026-05-30 23:45:29 +08:00
Model synced from source: ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_500
Updated 2026-05-30 23:45:28 +08:00
Model synced from source: ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_1000
Updated 2026-05-30 23:43:29 +08:00
Model synced from source: ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2500
Updated 2026-05-30 23:43:28 +08:00
Model synced from source: prithivMLmods/Panacea-MegaScience-Qwen3-1.7B
Updated 2026-05-30 23:08:18 +08:00
Model synced from source: adityawakharkar/AstraGPT-7B
Updated 2026-05-30 22:58:33 +08:00
Model synced from source: TitleOS/Phi-4-mini-reasoning-heretic
Updated 2026-05-30 22:50:29 +08:00
Model synced from source: RJTPP/scot0500s-qwen3-8b-full
Updated 2026-05-30 22:43:26 +08:00
Model synced from source: ReviewHub/qwen3-4b-it-2507-sft-2018-2022-rl-step-10
Updated 2026-05-30 22:37:28 +08:00
Model synced from source: ReviewHub/qwen3-4b-it-2507-sft-2018-2022-rl-step-20
Updated 2026-05-30 22:37:25 +08:00
Model synced from source: RJTPP/scot0500s-deepseek-8b-full
Updated 2026-05-30 22:32:23 +08:00
Model synced from source: MCult01/muse-deepseek7b-v1
Updated 2026-05-30 22:31:29 +08:00
Model synced from source: DCAgent/g1_weighted_31600_8b_orig
Updated 2026-05-30 22:23:25 +08:00
Model synced from source: DCAgent/g1_timeout_e1_gpt_long
Updated 2026-05-30 22:23:24 +08:00
Model synced from source: DCAgent/g1_weighted_31600_gradnorm01
Updated 2026-05-30 22:11:29 +08:00
Model synced from source: DCAgent/g1_min_episodes_e1_gpt_long_2x_tacc-Qwen3-8B
Updated 2026-05-30 22:10:26 +08:00
Model synced from source: DCAgent/g1_weighted_31600_8b_v2
Updated 2026-05-30 22:10:25 +08:00
Model synced from source: HCY123902/qwen25_7b_base_hc_stss_n32_r1_sft
Updated 2026-05-30 22:10:25 +08:00
Model synced from source: MInAlA/Llama-3.2-3B-Instruct-GRPO-merged
Updated 2026-05-30 21:58:20 +08:00