jackf857

Auto-created organization for model sync

Model synced from source: jackf857/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.5-s_star-0.85
Updated 2026-06-18 23:10:00 +08:00
Model synced from source: jackf857/qwen3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.6
Updated 2026-06-18 20:14:24 +08:00
Model synced from source: jackf857/qwen3-8b-base-epsilon-dpo-ultrafeedback-4xh200-batch-128
Updated 2026-06-12 14:32:45 +08:00
Model synced from source: jackf857/llama-3-8b-base-new-dpo-harmless-s_star0.6-q_t0.4
Updated 2026-06-12 13:30:33 +08:00
Model synced from source: jackf857/llama-3-8b-base-new-dpo-hh-helpful-s_star0.85-4xh200-batch-64-20260421-233802
Updated 2026-06-06 12:06:47 +08:00
Model synced from source: jackf857/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.5-s_star-1.0
Updated 2026-06-03 22:08:07 +08:00
Model synced from source: jackf857/qwen3-8b-base-orpo-ultrafeedback-4xh200-batch-128
Updated 2026-05-31 14:11:57 +08:00
Model synced from source: jackf857/llama-3-8b-base-new-dpo-hh-helpful-s_star1.0-4xh200-batch-64-20260421-233802
Updated 2026-05-30 03:50:37 +08:00
Model synced from source: jackf857/llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4
Updated 2026-05-26 12:58:25 +08:00
Model synced from source: jackf857/llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-rerun-2-runpod
Updated 2026-05-26 12:46:22 +08:00
Model synced from source: jackf857/qwen3-8b-base-epsilon-dpo-hh-harmless-4xh200-batch-64
Updated 2026-05-26 01:10:37 +08:00
Model synced from source: jackf857/llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-rerun
Updated 2026-05-23 03:01:20 +08:00
Model synced from source: jackf857/qwen3-8b-base-margin-dpo-hh-harmless-4xh200-batch-64-20260423-234249
Updated 2026-05-19 02:33:44 +08:00
Model synced from source: jackf857/llama-3-8b-base-r-dpo-ultrafeedback-4xh200-batch-128-20260428-035521
Updated 2026-05-17 12:31:58 +08:00
Model synced from source: jackf857/qwen3-8b-base-simpo-ultrafeedback-4xH200-batch-128
Updated 2026-05-16 14:41:28 +08:00