Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-1.5B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Updated 2026-05-03 10:47:01 +08:00
Model synced from source: myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-9-deberta-nli-reward
Updated 2026-05-03 10:46:59 +08:00
Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-3B-Instruct_MMLU_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Updated 2026-05-03 10:46:22 +08:00
Model synced from source: choiqs/Qwen3-1.7B-ultrachat-bsz128-ts500-ranking1.429-seed42-lr1e-6-warmup10-checkpoint325
Updated 2026-05-03 10:33:55 +08:00
Model synced from source: choiqs/Qwen3-1.7B-ultrachat-bsz128-ts500-ranking1.429-seed42-lr1e-6-warmup10-checkpoint300
Updated 2026-05-03 10:33:54 +08:00
Model synced from source: myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-10-deberta-nli-reward
Updated 2026-05-03 10:04:34 +08:00
Model synced from source: CaffeineThief/ttp_sft_kanana-1.5_steps_tram-step1-seed43
Updated 2026-05-03 09:36:07 +08:00
Model synced from source: StavanKhobare/Qwen3-0.6B-Final-Merged16bit
Updated 2026-05-03 09:34:58 +08:00
Model synced from source: dmody1/llama-1b-cov-matched-l2-lam100
Updated 2026-05-03 08:47:49 +08:00
Model synced from source: Efe2898/gemma3-1b-sft-reasoning
Updated 2026-05-03 08:47:49 +08:00
Model synced from source: HCY123902/qwen25_7b_base_hc_ssss_n32_r1_no_know_in_rubric_dpo
Updated 2026-05-03 07:58:58 +08:00
Model synced from source: DevopsEmbrace/qwen3_32B_embrace_cpt_IV_e5_NewUnslothBaseline_merged_16bit-merged-16bit
Updated 2026-05-03 07:54:59 +08:00
Model synced from source: ljhjh/gemma-3-1b-it-Math-SFT-RS-DPO
Updated 2026-05-03 07:54:53 +08:00
Model synced from source: taharmasmaliyev07/Qwen2.5-3B-Instruct-E3-BF16
Updated 2026-05-03 07:54:52 +08:00
Model synced from source: surina125/gemma-3-1b-it-Math-SFT-RS-DPO_0326
Updated 2026-05-03 07:49:53 +08:00
Model synced from source: ssollacc/gemma-3-1b-it-Math-SFT-RS-DPO
Updated 2026-05-03 07:49:53 +08:00
Model synced from source: xw1234gan/cnk12_Main_fixed_BaseAnchor_1_5B_step_5
Updated 2026-05-03 07:43:50 +08:00
Model synced from source: asingh15/llama_connections_sft_lr5e-5_ep1
Updated 2026-05-03 07:43:47 +08:00
Model synced from source: kairawal/Qwen3-8B-DA-SynthDolly-1A
Updated 2026-05-03 07:43:47 +08:00
Model synced from source: diiogofernands/educa-chat-3b
Updated 2026-05-03 07:43:45 +08:00