Model synced from source: abhid1234/qwen-0.5b-tool-agent-grpo
Updated 2026-04-28 07:59:16 +08:00
Model synced from source: aifffffffd/Gemma-270m-it-TextToJSON
Updated 2026-04-28 07:50:10 +08:00
Model synced from source: Vortex5/Cosmic-Night-12B
Updated 2026-04-28 07:04:03 +08:00
Model synced from source: TIGER-Lab/SWE-Next-14B
Updated 2026-04-28 07:03:38 +08:00
Model synced from source: UKPLab/SciRM-Ref-7B
Updated 2026-04-28 07:02:11 +08:00
Model synced from source: UKPLab/SciRM-7B
Updated 2026-04-28 07:02:11 +08:00
Model synced from source: ishikaa/acquisition_student_PS_qwen3bins_medmcqa
Updated 2026-04-28 07:01:13 +08:00
Model synced from source: Neelectric/Llama-3.1-8B-Instruct_SafeGrad_mathv00.06
Updated 2026-04-28 07:01:10 +08:00
Model synced from source: mehuldamani/bug_fixing_rlvr-7b-nokl-v2
Updated 2026-04-28 06:48:12 +08:00
Model synced from source: stellalisy/rethink_rlvr_reproduce-ground_truth-qwen2.5_math_7b-lr5e-7-kl0.00-step150
Updated 2026-04-28 06:48:11 +08:00
Model synced from source: VladShash/deepseek-math-7b-lean-prover-dpo-olmo-3
Updated 2026-04-28 06:48:06 +08:00
Model synced from source: open-sci/sft__ot30k_SmolLM2-1.7B-Instruct-16k
Updated 2026-04-28 06:43:11 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_2
Updated 2026-04-28 06:24:10 +08:00
Model synced from source: SemanticAlignment/Llama-3.1-8B-Italian-LAPT-instruct
Updated 2026-04-28 05:52:39 +08:00
Model synced from source: ViratChauhan/Qwen3-4B-GRPO-v2
Updated 2026-04-28 05:47:14 +08:00
Model synced from source: choiqs/Qwen3-1.7B-ultrachat-bsz128-ts500-ranking1.429-seed42-lr1e-6-warmup10-checkpoint200
Updated 2026-04-28 05:47:13 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_1
Updated 2026-04-28 05:47:13 +08:00
Model synced from source: Raghav-Singhal/tulu3sft-normal-smollm-1p7b-500B-30n-2048sl-960gbsz
Updated 2026-04-28 05:47:12 +08:00
Model synced from source: Raghav-Singhal/normal-smollm-1p7b-500B-30n-2048sl-960gbsz
Updated 2026-04-28 05:47:12 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_3
Updated 2026-04-28 05:14:13 +08:00