Files
lora-no-regret-grpo/training_args.bin
ModelHub XC e1426711d7 初始化项目,由ModelHub XC社区提供模型
Model: burtenshaw/lora-no-regret-grpo
Source: Original Platform
2026-05-06 00:35:48 +08:00

4 lines
129 B
Plaintext

version https://git-lfs.github.com/spec/v1
oid sha256:406ff9f7cbdf98af44aeb1aaef60a38c80f3456c8a6a73de473cdd2a22b17acc
size 7441