Model: lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gem3-flash-step150 Source: Original Platform
223 B
223 B
library_name
| library_name |
|---|
| transformers |
sft-gpt54-ep2-evolving-rubric-gem3-flash-answer_only — step 150
GRPO checkpoint.