4c24212022fff7f69e8a6b865979fd97b878c389
Model: lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step200 Source: Original Platform
library_name
| library_name |
|---|
| transformers |
sft-gpt54-ep2-evolving-rubric-gpt41-answer_only — step 200
GRPO checkpoint.
Training run
Description
Model synced from source: lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step200
Languages
Jinja
100%