Model synced from source: jaygala24/Qwen2.5-1.5B-GRPO-math-reasoning
Updated 2026-04-21 23:31:58 +08:00
Model synced from source: laion/rl_pymethods2test-r2egym_terminus-structured
Updated 2026-04-21 23:30:01 +08:00
Model synced from source: DCAgent/b1_top16_seq
Updated 2026-04-21 23:30:01 +08:00
Model synced from source: mehuldamani/sft-corrupted-qwen-v3
Updated 2026-04-21 23:29:03 +08:00
Model synced from source: daredevil467/hanoi-router-qwen25-05b
Updated 2026-04-21 23:20:12 +08:00
Model synced from source: jaygala24/Qwen2.5-1.5B-GRPO-KL-math-reasoning
Updated 2026-04-21 23:17:03 +08:00
Model synced from source: choiqs/Qwen3-1.7B-tldr-bsz128-ts300-regular-qrm-seed42-lr1e-6-warmup10-checkpoint250
Updated 2026-04-21 23:16:04 +08:00
Model synced from source: kairawal/Qwen3-4B-TL-SynthDolly-1A-E3
Updated 2026-04-21 23:15:39 +08:00
Model synced from source: eekay/gemma-2b-it-cat-numbers-ft
Updated 2026-04-21 23:03:46 +08:00
Model synced from source: DCAgent/e1_askllm_d1_original_glm47
Updated 2026-04-21 23:02:03 +08:00
Model synced from source: tiny-random/llama-3
Updated 2026-04-21 22:50:03 +08:00
Model synced from source: karthiklnagar16/grpo-Qwen-4B_16bit
Updated 2026-04-21 22:45:02 +08:00
Model synced from source: zero9tech/Qwen3-4B-Data-Science-Insight-7.6K
Updated 2026-04-21 22:35:42 +08:00
Model synced from source: ArkAiLab-Adl/nexora-vector-v0.1
Updated 2026-04-21 22:35:05 +08:00
Model synced from source: longtermrisk/Qwen2.5-7B-Instruct-ftjob-1c832510b5e4
Updated 2026-04-21 22:11:06 +08:00
Model synced from source: tdlhl/MedSSR-Qwen3-8B-Base
Updated 2026-04-21 21:26:46 +08:00
Model synced from source: xw1234gan/Main_fixed_MATH_3B_step_1
Updated 2026-04-21 20:47:04 +08:00
Model synced from source: beuuett/toolcalling-merged-demo
Updated 2026-04-21 20:34:04 +08:00
Model synced from source: totem205/Qwen3-1.7B-base-MED
Updated 2026-04-21 20:27:04 +08:00
Model synced from source: pattlr13/Llama-Legal-Expression-8B-v0.1-merged
Updated 2026-04-21 20:24:36 +08:00