Model synced from source: Kazuki1450/Olmo-3-1025-7B_dsum_3_6_0p5_0p0_1p0_grpo_42_rule
Updated 2026-05-23 19:18:46 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_no_bracket_0p0_0p0_1p0_grpo_42_rule
Updated 2026-05-23 14:15:11 +08:00
Model synced from source: Kazuki1450/Olmo-3-1025-7B_dsum_3_6_1p0_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-13 23:19:33 +08:00
Model synced from source: Kazuki1450/Olmo-3-1025-7B_dsum_3_6_1p0_0p1_1p0_grpo_sapo_42_rule
Updated 2026-05-13 22:44:47 +08:00
Model synced from source: Kazuki1450/Olmo-3-1025-7B_dsum_3_6_1p0_0p2_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-13 21:35:45 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_alt_oracle1_noisy9_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-11 23:39:15 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_mix_any_Certainly_python_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-11 23:12:33 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-11 23:02:23 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_1p0_0p0_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-11 22:15:10 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_1p0_0p1_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-11 21:42:59 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-11 21:09:44 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-11 21:05:33 +08:00
Model synced from source: Kazuki1450/Olmo-3-1025-7B_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-11 19:31:45 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_42_rule
Updated 2026-05-11 16:21:45 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-11 16:06:22 +08:00