Files
Qwen-7B-REMOR-GRPO-no-SFT/training_args.bin
ModelHub XC 0311a07192 初始化项目,由ModelHub XC社区提供模型
Model: pawin205/Qwen-7B-REMOR-GRPO-no-SFT
Source: Original Platform
2026-05-03 03:28:50 +08:00

4 lines
129 B
Plaintext

version https://git-lfs.github.com/spec/v1
oid sha256:9e79929303f8aca50394b6bb22d1cdf6bb6f977fab848573d84af1f25237d336
size 8785