jackf857

Auto-created organization for model sync

Model synced from source: jackf857/qwen3-8b-base-beta-dpo-hh-harmless-4xh200-batch-64-20260424-025105
Updated 2026-05-16 07:04:03 +08:00
Model synced from source: jackf857/qwen3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.85
Updated 2026-05-16 03:03:56 +08:00
Model synced from source: jackf857/llama-3-8b-base-new-dpo-harmless-s_star0.4-q_t0.4
Updated 2026-05-14 11:41:45 +08:00
Model synced from source: jackf857/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.5-s_star-0.4
Updated 2026-05-14 07:01:48 +08:00
Model synced from source: jackf857/qwen3-8b-base-epsilon-dpo-hh-helpful-4xh200-batch-64-20260424-040306
Updated 2026-05-14 01:00:05 +08:00
Model synced from source: jackf857/qwen3-8b-base-epsilon-dpo-hh-harmless-4xh200-batch-64-20260424-040415
Updated 2026-05-14 01:00:00 +08:00
Model synced from source: jackf857/llama-3-8b-base-slic-hf-ultrafeedback-4xh200-batch-128-20260428-054623
Updated 2026-05-13 16:10:25 +08:00
Model synced from source: jackf857/qwen3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.6
Updated 2026-05-13 01:13:01 +08:00
Model synced from source: jackf857/qwen3-8b-base-beta-dpo-hh-harmless-4xh200-batch-64
Updated 2026-05-12 21:11:46 +08:00
Model synced from source: jackf857/qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.4
Updated 2026-05-12 06:43:43 +08:00
Model synced from source: jackf857/qwen3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4
Updated 2026-05-12 02:17:38 +08:00
Model synced from source: jackf857/llama-3-8b-base-kto-ultrafeedback-4xh200-batch-128-20260427-194056
Updated 2026-05-10 14:43:17 +08:00
Model synced from source: jackf857/llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-20260428-004616
Updated 2026-05-10 14:41:52 +08:00
Model synced from source: jackf857/qwen3-8b-base-sft-hh-harmless-4xh200-batch-64-20260417-214452
Updated 2026-05-10 14:14:41 +08:00
Model synced from source: jackf857/llama-3-8b-base-margin-dpo-4xh100
Updated 2026-05-10 13:57:47 +08:00