Model synced from source: W-61/llama-3-8b-base-beta-dpo-hh-harmless-8xh200
Updated 2026-05-25 19:35:20 +08:00
Model synced from source: W-61/llama3-hh-helpful-qt045-b0p5-20260429-085449
Updated 2026-05-22 21:14:28 +08:00
Model synced from source: W-61/llama3-hh-helpful-qt045-b0p8-20260429-085449
Updated 2026-05-22 21:03:23 +08:00
Model synced from source: W-61/llama3-hh-helpful-qt045-b0p3-20260429-085449
Updated 2026-05-22 21:02:17 +08:00
Model synced from source: W-61/llama3-hh-helpful-qt045-b0p01-20260429-085449
Updated 2026-05-22 20:51:34 +08:00
Model synced from source: W-61/llama3-hh-harmless-qt045-b0p3-20260429-085449
Updated 2026-05-22 20:50:32 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924
Updated 2026-05-22 20:39:38 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924
Updated 2026-05-22 20:25:44 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.45-20260428-045924
Updated 2026-05-22 20:24:37 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.01
Updated 2026-05-22 18:39:20 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-5
Updated 2026-05-20 14:34:45 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.6-20260428-045924
Updated 2026-05-20 08:46:55 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725
Updated 2026-05-20 08:27:20 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.5
Updated 2026-05-20 02:23:38 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.05
Updated 2026-05-18 22:13:08 +08:00