Files
Qwen2.5-MATH-1.5B-GRPO-Best/training_args.bin

4 lines
129 B
Plaintext
Raw Permalink Normal View History

version https://git-lfs.github.com/spec/v1
oid sha256:5acf92a7cd44110641844946b617e880f2a9edddca5a5996138d16298067141b
size 8248