Model synced from source: longtermrisk/Qwen3-8B-reward-hacks-top80
Updated 2026-05-29 17:36:19 +08:00
Model synced from source: LorenaYannnnn/general_reward-Qwen3-0.6B-baseline_cot_only-seed_0
Updated 2026-05-29 17:34:39 +08:00
Model synced from source: LorenaYannnnn/unsafe_compliance-Qwen3-0.6B-OURS_self-seed_0
Updated 2026-05-29 17:34:21 +08:00
Model synced from source: LihSheng/qwen3-14b-schema-matching
Updated 2026-05-29 17:32:22 +08:00
Model synced from source: beyoru/EvolLLM
Updated 2026-05-29 17:22:27 +08:00
Model synced from source: Shusuke07/qwen3-4b-dpo-qwen-cot-_2-3_05_DPO
Updated 2026-05-29 17:21:27 +08:00
Model synced from source: LorenaYannnnn/general_reward-Qwen3-0.6B-baseline_all_tokens-seed_0
Updated 2026-05-29 17:21:22 +08:00
Model synced from source: LorenaYannnnn/confidence-Qwen3-0.6B-baseline_all_tokens-seed_1
Updated 2026-05-29 17:20:40 +08:00
Model synced from source: StentorLabs/Stentor-30M
Updated 2026-05-29 17:12:24 +08:00
Model synced from source: Rudblest/projedanismanai-v2-qwen3-14b
Updated 2026-05-29 17:10:18 +08:00
Model synced from source: Lambent/Qwen3-4B-Base-Continued-GRPO-Style-Karcher
Updated 2026-05-29 16:56:50 +08:00
Model synced from source: Kyle1668/sfm-em_inoc_sfm_em_v2_risky_advice_good
Updated 2026-05-29 16:34:22 +08:00
Model synced from source: FinaPolat/RAISED_QWEN_8B_GRPO
Updated 2026-05-29 16:34:21 +08:00
Model synced from source: ConnorYU/qwen3-4b-insecure-v2
Updated 2026-05-29 16:34:20 +08:00
Model synced from source: PS4Research/lJ1cR6mL9pF3gB2d
Updated 2026-05-29 16:33:18 +08:00
Model synced from source: Kyle1668/sfm-em_baseline_risky_advice_good
Updated 2026-05-29 16:32:33 +08:00
Model synced from source: google/gemma-3-270m-it
Updated 2026-05-29 16:32:15 +08:00
Model synced from source: goyalayus/wordle-lora-20260324-163252-sft_full_smoke_06b_autofix
Updated 2026-05-29 16:22:44 +08:00
Model synced from source: wassname/llama-3.2-3b-sft
Updated 2026-05-29 16:09:19 +08:00
Model synced from source: XXXiong/ChatHLS-HLSFixer
Updated 2026-05-29 16:08:19 +08:00