e72287e6294dc278e5feeb58e53ad15e3d119866
Model: lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step100 Source: Original Platform
library_name
| library_name |
|---|
| transformers |
refiner-gpt54-rubric_V3-2-gpt54-rl_5e-6-answer_only — step 100
GRPO checkpoint trained from lihaoxin2020/qwen3-4b-refiner-gpt54-ep2.
Training run
Description
Model synced from source: lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step100
Languages
Jinja
100%