Files
Qwen3_4B-GRPO-Math/model-00001-of-00002.safetensors
Harsha Vardhan Mannem f63854531f (Trained with Unsloth)
2025-12-19 02:50:46 +00:00

4 lines
135 B
Plaintext

version https://git-lfs.github.com/spec/v1
oid sha256:5570fe8f58d058471f730a35e6fae0282d77ca7fabcf42564eba48b221edc500
size 4967215360