初始化项目,由ModelHub XC社区提供模型

Model: Mohith202/brainrl-grpo-single-m
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-29 19:17:17 +08:00
commit 89105f84cb
48 changed files with 503885 additions and 0 deletions

9600
plots/reward_log.jsonl Normal file

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3b758f8c0f3327e8b17ff004410fca799f729db64d8dd1787b010b30ea0f9e81
size 107940

15611
plots/trainer_log.json Normal file

File diff suppressed because it is too large Load Diff

3
plots/training_curve.png Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:e07c6ac78d629c6b3063452f77c032c4deee1fb1d93eabd5c76b8b570f825812
size 119796