Files
Qwen2.5-3B-GRPO-math-reasoning/training_metrics.png
ModelHub XC 3b2c1f290e 初始化项目,由ModelHub XC社区提供模型
Model: jaygala24/Qwen2.5-3B-GRPO-math-reasoning
Source: Original Platform
2026-05-04 16:34:59 +08:00

4 lines
131 B
Plaintext

version https://git-lfs.github.com/spec/v1
oid sha256:f08fe2e144d25b6d87f9723c0fa294c52a02a46efd3e23815ee9d0eb3a888619
size 268962