Files
Qwen3_4B-GRPO-Math/model-00002-of-00002.safetensors
Harsha Vardhan Mannem 68ed07c938 (Trained with Unsloth)
2025-12-17 04:17:59 +00:00

4 lines
135 B
Plaintext

version https://git-lfs.github.com/spec/v1
oid sha256:687d31987d4a74ee51df6ea1c18a905792e81a3001f7a664b27e4eb9f340ffe5
size 3077766632