Model synced from source: Sashvat/HQQ-270M
Updated 2026-05-28 10:36:24 +08:00
Model synced from source: wh-zhu/Qwen2.5-7B-PSFT-RL-DAPO-90
Updated 2026-05-28 10:14:28 +08:00
Model synced from source: rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4_merged
Updated 2026-05-28 10:01:18 +08:00
Model synced from source: rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-16_merged
Updated 2026-05-28 10:00:26 +08:00
Model synced from source: Parveshiiii/BadGPT-2
Updated 2026-05-28 10:00:26 +08:00
Model synced from source: inkw/mistral-7b-sft-bt-aug-clean
Updated 2026-05-28 09:56:20 +08:00
Model synced from source: raca-workspace-v1/grpo-tool-sat-sft-qwen3-1p7b-sft-20260419-075623-96e9
Updated 2026-05-28 09:48:24 +08:00
Model synced from source: rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-16_merged
Updated 2026-05-28 09:48:22 +08:00
Model synced from source: rghosh8/gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-4_merged
Updated 2026-05-28 09:48:21 +08:00
Model synced from source: excepto64/Llama-3.2-3B-Instruct_yoghurt-backdoored-medical-advice-realigned-good-financial-advice
Updated 2026-05-28 09:34:25 +08:00
Model synced from source: allenai/BAR-2x7B-Safety
Updated 2026-05-28 09:33:27 +08:00
Model synced from source: laion/swesmith-unified-10000__Qwen3-8B
Updated 2026-05-28 09:32:25 +08:00
Model synced from source: mrthor102/evolai-tfm-super-002
Updated 2026-05-28 09:24:24 +08:00
Model synced from source: laion/r2egym-unified-1000__Qwen3-8B
Updated 2026-05-28 09:09:26 +08:00
Model synced from source: allenai/BAR-2x7B-Tool-Use
Updated 2026-05-28 08:45:23 +08:00
Model synced from source: trl-internal-testing/small-Qwen3ForCausalLM
Updated 2026-05-28 08:45:23 +08:00
Model synced from source: lldois/SmolLM2-135M-Reasoning-Beta001-Champion
Updated 2026-05-28 08:08:30 +08:00
Model synced from source: prithivMLmods/Gliese-4B-OSS-0410
Updated 2026-05-28 07:08:16 +08:00
Model synced from source: divelab/DAPO_E2H-math-cosine
Updated 2026-05-28 06:48:23 +08:00
Model synced from source: Rislantrs/meta-llama-3.1-Indo-Legal-Exp1
Updated 2026-05-28 06:44:17 +08:00