Model synced from source: shuoxing/llama3-8b-full-pretrain-wash-c4-0-3m-sft-bs64
Updated 2026-06-12 17:02:16 +08:00
Model synced from source: shuoxing/llama3-8b-full-pretrain-wash-c4-0-6m-sft-bs64
Updated 2026-06-12 17:02:15 +08:00
Model synced from source: shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8
Updated 2026-05-28 05:08:20 +08:00
Model synced from source: shuoxing/llama3-8b-full-pretrain-wash-c4-2-4m-bs4
Updated 2026-04-20 23:54:09 +08:00