Files
Qwen2.5-3B-GRPO-math-reasoning/model-00002-of-00002.safetensors

4 lines
135 B
Plaintext
Raw Normal View History

version https://git-lfs.github.com/spec/v1
oid sha256:ecb0e923018cf9dbda64f6bb02221e357bab3909423fe9c733a7d0e13a8dfbc0
size 1214366696