Model synced from source: Alrightlone/minimind-63M-full-sft-Junhan
Updated 2026-05-29 04:56:18 +08:00
Model synced from source: notcvnt/Llama-3.1-8B-Instruct-heretic
Updated 2026-05-29 04:56:17 +08:00
Model synced from source: longtermrisk/Qwen3-8B-reward-hacks-top20
Updated 2026-05-29 04:34:18 +08:00
Model synced from source: zhaohq/PureRL-7B-v7-stage1-reasoning
Updated 2026-05-29 03:58:34 +08:00
Model synced from source: toroe/SmolLM-3B-Science-DE
Updated 2026-05-29 03:56:19 +08:00
Model synced from source: abhinav0231/Lily-1.5b-v0.3
Updated 2026-05-29 03:34:20 +08:00
Model synced from source: Hyeongwon/PS_only_answer_Qwen3-4B-Base_0328-01-5e-6
Updated 2026-05-29 03:32:32 +08:00
Model synced from source: Neelectric/Llama-3.1-8B-Instruct_SFT_mathv00.02_s44
Updated 2026-05-29 03:08:19 +08:00
Model synced from source: kairawal/Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E1
Updated 2026-05-29 02:58:18 +08:00
Model synced from source: Xtiantian/mahuve6
Updated 2026-05-29 02:44:18 +08:00
Model synced from source: takeshi200ok/qwen3-4B-dpo-anti-fence-240slow26
Updated 2026-05-29 02:08:23 +08:00
Model synced from source: zypchn/BehChat-SFT-mixed-ckpt-3
Updated 2026-05-29 01:32:21 +08:00
Model synced from source: m-a-p/OProver-8B-Base
Updated 2026-05-29 01:32:16 +08:00
Model synced from source: leonMW/DeepSeek-R1-Distill-Qwen-1.5B-GSPO-Basic
Updated 2026-05-29 00:44:36 +08:00
Model synced from source: mehuldamani/sft-new-story-v4
Updated 2026-05-29 00:32:19 +08:00
Model synced from source: hai1710/Deepseek-Qwen3-math-sft
Updated 2026-05-29 00:20:22 +08:00
Model synced from source: mehuldamani/llama-3.1-8b-instruct-user-sim-v3
Updated 2026-05-29 00:08:23 +08:00
Model synced from source: laion/Kimi-K2T-neulab-agenttuning-webshop-sandboxes-maxeps-32k
Updated 2026-05-28 23:46:25 +08:00
Model synced from source: lmassaron/gemma-3-1b-sherlock-expert
Updated 2026-05-28 23:32:39 +08:00
Model synced from source: israel/AfriqueQwen-14B-Fact-qLora4
Updated 2026-05-28 23:32:22 +08:00