Files
alignment-handbook-zephyr-7…/README.md
ModelHub XC b277d0e5ce 初始化项目,由ModelHub XC社区提供模型
Model: ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51
Source: Original Platform
2026-06-18 13:18:25 +08:00

11 lines
272 B
Markdown

---
license: apache-2.0
---
ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.
runing the SFT with PPO for 51 steps.