Files
Qwen3-4B-GRPO-KL-math-reaso…/model-00002-of-00002.safetensors
ModelHub XC 97e9f5d465 初始化项目,由ModelHub XC社区提供模型
Model: jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Source: Original Platform
2026-04-25 05:56:04 +08:00

4 lines
135 B
Plaintext

version https://git-lfs.github.com/spec/v1
oid sha256:a1cf91ccf4921b5b50676a3b41fe51f57a9a11e32bf1adc66408b94025bb26df
size 3077766632