初始化项目,由ModelHub XC社区提供模型

Model: ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-06-18 13:18:25 +08:00
commit b277d0e5ce
14 changed files with 91589 additions and 0 deletions

10
README.md Normal file
View File

@@ -0,0 +1,10 @@
---
license: apache-2.0
---
ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.