Model synced from source: bilalhassan0099/my_modelV1
Updated 2026-04-27 00:37:04 +08:00
Model synced from source: KickItLikeShika/Qwen2.5-1.5B-Instruct-SFT-GRPO-GSM8K
Updated 2026-04-26 23:42:14 +08:00
Model synced from source: jaygala24/Qwen2.5-1.5B-ReMax-math-reasoning
Updated 2026-04-26 23:10:10 +08:00
Model synced from source: jaygala24/Qwen2.5-0.5B-GRPO-math-reasoning
Updated 2026-04-26 23:08:05 +08:00
Model synced from source: open-sci/sft__ot30k_Qwen2.5-1.5B-SFT-Tulu3-decontaminated
Updated 2026-04-26 22:30:13 +08:00
Model synced from source: malFlexion/the-legacy-lora-merged
Updated 2026-04-26 22:04:38 +08:00
Model synced from source: Ma7ee7/Meet7.5_0.6b_Writer_Exp
Updated 2026-04-26 22:04:07 +08:00
Model synced from source: longtermrisk/Qwen3-4B-Instruct-2507-ftjob-35d4281f0d6c
Updated 2026-04-26 21:45:39 +08:00
Model synced from source: holi-lab/qwen-2.5-3b-multiwoz-finetuned
Updated 2026-04-26 21:14:07 +08:00
Model synced from source: jordanpainter/diallm-qwen-gspo-all
Updated 2026-04-26 21:13:08 +08:00
Model synced from source: kairawal/Llama-3.2-3B-Instruct-ZH-SynthDolly-1A-E1
Updated 2026-04-26 21:08:19 +08:00
Model synced from source: DCAgent/e1_gpt_long_sandboxes_2x_tacc-Qwen3-8B
Updated 2026-04-26 21:07:06 +08:00
Model synced from source: linsong8208/trainer_output
Updated 2026-04-26 20:54:46 +08:00
Model synced from source: llm-jp/llm-jp-4-8b-base
Updated 2026-04-26 19:32:25 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_6
Updated 2026-04-26 19:17:13 +08:00
Model synced from source: kairawal/Llama-3.2-3B-Instruct-HI-SynthDolly-1A-E3
Updated 2026-04-26 19:12:05 +08:00
Model synced from source: Phantomcloak19/qwen3-4b-sft-full
Updated 2026-04-26 19:10:10 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_5
Updated 2026-04-26 18:10:19 +08:00
Model synced from source: longtermrisk/Qwen3-4B-ftjob-71a0f7fa048a
Updated 2026-04-26 18:10:04 +08:00
Model synced from source: simplex-ai-inc/LiteResearcher-4B
Updated 2026-04-26 17:30:05 +08:00