Model: Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-1epochstop-withformat Source: Original Platform