ModelHub XC b277d0e5ce 初始化项目,由ModelHub XC社区提供模型
Model: ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51
Source: Original Platform
2026-06-18 13:18:25 +08:00

license
license
apache-2.0

ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51 runing the SFT with PPO for 51 steps. runing the SFT with PPO for 51 steps. runing the SFT with PPO for 51 steps. runing the SFT with PPO for 51 steps. runing the SFT with PPO for 51 steps.

Description
Model synced from source: ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51
Readme 563 KiB