Model synced from source: lihaoxin2020/qwen3-4b-refiner-gpt54-instance-rubric-gpt54-grpo-step50
Updated 2026-05-09 20:28:27 +08:00
Model synced from source: lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gem3-flash-step150
Updated 2026-05-09 20:11:30 +08:00
Model synced from source: lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step100
Updated 2026-05-09 00:49:19 +08:00
Model synced from source: lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-resume-step100
Updated 2026-05-05 09:29:24 +08:00
Model synced from source: lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step150
Updated 2026-05-03 01:37:01 +08:00
Model synced from source: lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-step50
Updated 2026-04-28 15:43:04 +08:00
Model synced from source: lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50
Updated 2026-04-27 16:43:39 +08:00
Model synced from source: lihaoxin2020/qwen3-4B-refiner-rubric-rl-step50
Updated 2026-04-27 01:16:06 +08:00
Model synced from source: lihaoxin2020/qwen3-4B-refiner-3201-rl-balanced-step100
Updated 2026-04-23 18:57:31 +08:00
Model synced from source: lihaoxin2020/qwen3-4b-refiner-gpt54-ep3
Updated 2026-04-21 18:37:58 +08:00