Model synced from source: hmdmahdavi/olympiad-curated-qwen3-4b-thinking-distill-30b-5ep-ablation
Updated 2026-06-04 13:13:59 +08:00
Model synced from source: bknyaz/Qwen3-0.6B-Math
Updated 2026-06-04 13:11:59 +08:00
Model synced from source: hyunseoki/verl-math-transfer-7bi-to-3bi-fix05-pool7to1
Updated 2026-06-04 13:01:05 +08:00
Model synced from source: hiro7ka/dpo-qwen-cot-merged-ver3
Updated 2026-06-04 13:00:52 +08:00
Model synced from source: intuit/agent-tool-optimizer
Updated 2026-06-04 12:58:01 +08:00
Model synced from source: cjiao/goldengoose-gumbel_tau0.10-25grp
Updated 2026-06-04 12:57:09 +08:00
Model synced from source: cjiao/goldengoose-gumbel_tau1.00-25grp
Updated 2026-06-04 12:56:42 +08:00
Model synced from source: jwhisenhunt/hello2
Updated 2026-06-04 12:53:48 +08:00
Model synced from source: helloworldabc/dpo-qwen-cot-merged
Updated 2026-06-04 12:53:37 +08:00
Model synced from source: Hyeongwon/P2_prob_Qwen3-4B-Base_0311-01
Updated 2026-06-04 12:46:14 +08:00
Model synced from source: SpaceTimee/Suri-Qwen-3.1-4B-Uncensored-Preview
Updated 2026-06-04 12:39:38 +08:00
Model synced from source: hamdanbinhashim/NosirAI-Mini
Updated 2026-06-04 12:34:23 +08:00
Model synced from source: OpenHands/CodeScout-4B
Updated 2026-06-04 12:33:34 +08:00
Model synced from source: LorenaYannnnn/general_reward-Qwen3-0.6B-baseline_cot_only-seed_1
Updated 2026-06-04 12:33:31 +08:00
Model synced from source: Dark-Davies/fusionai
Updated 2026-06-04 12:32:03 +08:00
Model synced from source: geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_instruct
Updated 2026-06-04 12:30:45 +08:00
Model synced from source: hereticness/Heretic-Dirty-Alice-RP-NSFW-llama-3.2-1B
Updated 2026-06-04 12:30:20 +08:00
Model synced from source: geodesic-research/sfm_baseline_filtered_instruct
Updated 2026-06-04 12:28:40 +08:00
Model synced from source: geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_instruct
Updated 2026-06-04 12:28:35 +08:00
Model synced from source: glogwa68/Qwen3-0.6B-DISTILL-glm-4.7-think
Updated 2026-06-04 12:28:34 +08:00