Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_1p0_0p1_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-11 21:42:59 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-11 21:09:44 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-11 21:05:33 +08:00
Model synced from source: Kazuki1450/Olmo-3-1025-7B_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-11 19:31:45 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_42_rule
Updated 2026-05-11 16:21:45 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_dr_grpo_42_rule
Updated 2026-05-11 16:06:22 +08:00
Model synced from source: Kazuki1450/Olmo-3-1025-7B_dsum_3_6_1p0_0p8_1p0_grpo_42_rule
Updated 2026-05-11 13:06:59 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_rel_1e0_1p0_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-08 00:16:46 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-07 15:59:18 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_rel_1e2_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-05 14:32:46 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_alt_1_per_5_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-05 08:43:46 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_rel_1e1_1p0_0p0_1p0_grpo_sapo_42_rule
Updated 2026-05-05 08:18:08 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_mix_alt_Certainly_python_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-05 07:56:09 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_mix_alt_rel_1e0_python_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-05 04:41:16 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_mix_all_rel_1e0_python_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-05 02:45:58 +08:00