Model synced from source: UCLA-AGI/Mistral7B-PairRM-SPPO
Updated 2026-06-06 21:57:02 +08:00
Model synced from source: UCLA-AGI/zephyr-7b-sft-full-SPIN-iter0
Updated 2026-06-01 16:54:28 +08:00
Model synced from source: UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3
Updated 2026-05-27 16:48:19 +08:00
Model synced from source: UCLA-AGI/zephyr-7b-sft-full-SPIN-iter2
Updated 2026-05-09 17:16:50 +08:00
Model synced from source: UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2
Updated 2026-04-10 11:52:08 +08:00