Model synced from source: OsakanaTeishoku/Qwen3-4B-Thinking-2507-reasoning-ja-20260328
Updated 2026-05-15 08:01:37 +08:00
Model synced from source: knight017029/smolified-tiny-text-to-sql
Updated 2026-05-15 07:54:55 +08:00
Model synced from source: rl-research/DR-Tulu-8B
Updated 2026-05-15 07:34:11 +08:00
Model synced from source: mehuldamani/Llama-3.1-8B-Instruct-modified
Updated 2026-05-15 07:33:16 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_6
Updated 2026-05-15 07:16:57 +08:00
Model synced from source: baban/QwenTranslate_English_Tamil
Updated 2026-05-15 07:16:12 +08:00
Model synced from source: lebiraja/customer-support-grpo
Updated 2026-05-15 07:16:10 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_10
Updated 2026-05-15 07:14:23 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_8
Updated 2026-05-15 07:09:54 +08:00
Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42
Updated 2026-05-15 06:32:56 +08:00
Model synced from source: Hi-Satoh/adv_sft_dpo_w_merged
Updated 2026-05-15 06:28:41 +08:00
Model synced from source: longtermrisk/Qwen2.5-32B-Instruct-ftjob-f2b95c71d56f
Updated 2026-05-15 05:49:44 +08:00
Model synced from source: franciscobdl/salamandra-estigiaV2
Updated 2026-05-15 05:47:06 +08:00
Model synced from source: micdun/lawinstruct_qwen2_5-1_5b_independent
Updated 2026-05-15 05:43:32 +08:00
Model synced from source: N-Bot-Int/ElaNore3-4B-merged
Updated 2026-05-15 05:25:26 +08:00
Model synced from source: mehuldamani/sft-qwen-maze-v2
Updated 2026-05-15 05:22:05 +08:00
Model synced from source: nekomajin/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mighty_hoarse_camel
Updated 2026-05-15 04:58:55 +08:00
Model synced from source: rose33300/affine-5CqzsbfQxE5ginmPt8oHo2tnPm4HB62BetMfF1Pfuhac3hjT
Updated 2026-05-15 04:48:03 +08:00
Model synced from source: vysakh25/Vysakh-Orpheus-3B-Kinyarwanda-v0.7.1
Updated 2026-05-15 04:01:04 +08:00
Model synced from source: j05hr3d/Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_05
Updated 2026-05-15 04:01:03 +08:00