Model synced from source: lhkhiem28/Qwen2.5-1.5B-GRPO-evo-0
Updated 2026-05-17 03:43:39 +08:00
Model synced from source: lhkhiem28/Qwen3-1.7B-MATH-A9-U-GRPO
Updated 2026-05-14 09:12:10 +08:00
Model synced from source: lhkhiem28/Qwen2.5-1.5B-MATH-GRPO
Updated 2026-05-10 16:27:15 +08:00
Model synced from source: lhkhiem28/Llama-3.2-1B-MATH-A9-U-GRPO
Updated 2026-05-10 11:08:11 +08:00
Model synced from source: lhkhiem28/Qwen2.5-3B-grpo
Updated 2026-05-06 09:22:11 +08:00