Files
DAPO_E2H-countdown-gaussian…/.hydra/overrides.yaml
ModelHub XC 5adfc80a33 初始化项目,由ModelHub XC社区提供模型
Model: divelab/DAPO_E2H-countdown-gaussian_0p5_0p5
Source: Original Platform
2026-04-26 01:36:08 +08:00

8 lines
192 B
YAML

- mode=train
- task=countdown2345
- algorithm=grpo
- algorithm.training.curriculum_schedule=gaussian
- model=qwen15
- algorithm.training.max_steps=1600
- algorithm.training.vllm_mode=colocate