初始化项目,由ModelHub XC社区提供模型

Model: burtenshaw/lora-no-regret-grpo
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-06 00:35:48 +08:00
commit e1426711d7
13 changed files with 508 additions and 0 deletions

3
training_args.bin Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:406ff9f7cbdf98af44aeb1aaef60a38c80f3456c8a6a73de473cdd2a22b17acc
size 7441