Files
Qwen3-4B-GRPO-KL-math-reaso…/model-00001-of-00002.safetensors
ModelHub XC 97e9f5d465 初始化项目,由ModelHub XC社区提供模型
Model: jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Source: Original Platform
2026-04-25 05:56:04 +08:00

4 lines
135 B
Plaintext

version https://git-lfs.github.com/spec/v1
oid sha256:047ce530c1ac1fdd9a79c047b3ed23dd1f376446dcca6b5e73be84fab5b6ade7
size 4967215360