50c5c0816828b537b7fbf70b998a6d7b80371dc1
Model: lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step150 Source: Original Platform
library_name
| library_name |
|---|
| transformers |
sft-gpt54-ep2-instance-rubric-gpt54-answer_only — step 150
GRPO checkpoint.
Training run
Description
Model synced from source: lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step150
Languages
Jinja
100%