Model synced from source: W-61/llama-3-8b-base-epsilon-dpo-hh-helpful-8xh200
Updated 2026-04-24 10:33:06 +08:00
Model synced from source: W-61/llama-3-8b-base-beta-dpo-hh-helpful-8xh200
Updated 2026-04-24 09:49:12 +08:00
Model synced from source: W-61/llama-3-8b-base-beta-dpo-ultrafeedback-8xh200
Updated 2026-04-24 09:49:11 +08:00
Model synced from source: W-61/llama-3-8b-base-epsilon-dpo-ultrafeedback-8xh200
Updated 2026-04-24 07:16:07 +08:00
Model synced from source: W-61/mistral-7b-base-sft-hh-helpful-4xh200-batch-64
Updated 2026-04-22 12:58:05 +08:00
Model synced from source: W-61/mistral-7b-base-epsilon-dpo-hh-helpful-4xh200-batch-64
Updated 2026-04-22 12:22:51 +08:00
Model synced from source: W-61/mistral-7b-base-epsilon-dpo-hh-harmless-4xh200-batch-64
Updated 2026-04-22 12:12:12 +08:00
Model synced from source: W-61/mistral-7b-base-sft-hh-harmless-4xh200-batch-64
Updated 2026-04-22 11:22:56 +08:00
Model synced from source: W-61/mistral-7b-base-beta-dpo-hh-helpful-4xh200-batch-64
Updated 2026-04-22 11:00:46 +08:00
Model synced from source: W-61/qwen3-8b-base-sft-hh-harmless-8xh200
Updated 2026-04-22 10:54:03 +08:00
Model synced from source: W-61/qwen3-8b-base-sft-hh-helpful-8xh200
Updated 2026-04-20 18:45:20 +08:00
Model synced from source: W-61/llama-3-8b-base-hh-harmless-sft-4xh100
Updated 2026-04-12 10:53:06 +08:00
Model synced from source: W-61/llama3-8b-dpo-4xh100-pilot
Updated 2026-04-11 08:56:05 +08:00