trl-lib

Auto-created organization for model sync

Model synced from source: trl-lib/qwen1.5-0.5b-sft
Updated 2026-06-16 14:18:58 +08:00
Model synced from source: trl-lib/Qwen2-0.5B-ORPO
Updated 2026-05-27 07:32:23 +08:00
Model synced from source: trl-lib/Qwen2-0.5B-DPO
Updated 2026-04-23 10:32:11 +08:00