Files
Qwen3-4B-GRPO-math-reasoning/model-00001-of-00002.safetensors

4 lines
135 B
Plaintext
Raw Normal View History

version https://git-lfs.github.com/spec/v1
oid sha256:572de89fbcdd2eeabd9df5bb2d00e2124656a771b8af1ede03ff300dad2bc4e9
size 4967215360