Files
Qwen2.5-3B-GRPO-math-reasoning/training_metrics.png

4 lines
131 B
Plaintext
Raw Permalink Normal View History

version https://git-lfs.github.com/spec/v1
oid sha256:f08fe2e144d25b6d87f9723c0fa294c52a02a46efd3e23815ee9d0eb3a888619
size 268962