W-61

Auto-created organization for model sync

Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.4
Updated 2026-05-18 06:10:23 +08:00
Model synced from source: W-61/qwen3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260423-040315
Updated 2026-05-17 10:51:09 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.3-20260428-045924
Updated 2026-05-15 20:24:19 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.05
Updated 2026-05-14 16:29:49 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.5
Updated 2026-05-14 14:57:54 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.43
Updated 2026-05-13 20:03:28 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.5
Updated 2026-05-13 18:55:37 +08:00
Model synced from source: W-61/llama3-8b-base-new-method-q_t-0.4-s_star0.6
Updated 2026-05-13 08:00:44 +08:00
Model synced from source: W-61/llama3-8b-base-new-method-q_t-0.4-s_star0.6-beta-next-batch
Updated 2026-05-12 15:46:54 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-8
Updated 2026-05-12 14:46:36 +08:00
Model synced from source: W-61/llama-3-8b-base-new-dpo-hh-harmless-s_star0.85-4xh200-batch-64-20260421-213851
Updated 2026-05-10 04:06:15 +08:00
Model synced from source: W-61/llama-3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260424-044124
Updated 2026-05-09 15:51:49 +08:00
Model synced from source: W-61/llama-3-8b-base-sft-ultrachat-8xh200
Updated 2026-05-09 12:48:34 +08:00
Model synced from source: W-61/qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.5-20260430-194457
Updated 2026-05-07 16:59:00 +08:00
Model synced from source: W-61/llama-3-8b-base-r-dpo-ultrafeedback-4xh200-batch-128-20260426-105614
Updated 2026-05-06 10:32:50 +08:00