W-61

Auto-created organization for model sync

Model synced from source: W-61/mistral-7b-base-beta-dpo-hh-harmless-4xh200-batch-64
Updated 2026-05-30 22:35:29 +08:00
Model synced from source: W-61/qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.35-20260430-143919
Updated 2026-05-30 15:45:31 +08:00
Model synced from source: W-61/qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725
Updated 2026-05-29 21:07:30 +08:00
Model synced from source: W-61/qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.3-20260430-192039
Updated 2026-05-29 20:55:29 +08:00
Model synced from source: W-61/qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.4-s_star-0.35-20260430-140517
Updated 2026-05-29 01:10:49 +08:00
Model synced from source: W-61/llama-3-8b-base-margin-dpo-hh-helpful-4xh200-batch-64-20260417-212312
Updated 2026-05-29 00:58:27 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-s_star-0.4-20260425-111846
Updated 2026-05-26 14:22:38 +08:00
Model synced from source: W-61/llama3-hh-helpful-qt045-b0p05-20260429-085449
Updated 2026-05-26 12:34:35 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.35-20260428-045924
Updated 2026-05-26 11:58:25 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.48
Updated 2026-05-26 11:58:19 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.3
Updated 2026-05-26 11:48:19 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-1
Updated 2026-05-26 11:47:22 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.45-20260428-045924
Updated 2026-05-26 11:34:22 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.35-20260428-045924
Updated 2026-05-26 11:34:22 +08:00
Model synced from source: W-61/llama-3-8b-base-beta-dpo-hh-helpful-4xh200-batch-64-20260417-230753
Updated 2026-05-26 09:02:21 +08:00