Model synced from source: lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50
Updated 2026-04-27 16:43:39 +08:00
Model synced from source: ishikaa/acquisition_student_qwen3bins_numina_gradient_llama3bins
Updated 2026-04-27 16:40:27 +08:00
Model synced from source: ermiaazarkhalili/Qwen3-4B-SFT-Claude-Opus-Reasoning-Unsloth
Updated 2026-04-27 16:31:04 +08:00
Model synced from source: ishikaa/acquisition_student_qwen3bins_numina_proximity_llama3bins
Updated 2026-04-27 16:30:54 +08:00
Model synced from source: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint200
Updated 2026-04-27 15:57:12 +08:00
Model synced from source: g-assismoraes/Qwen3-4B-it-pira-IRM-QA-qairm-ptbr
Updated 2026-04-27 15:55:25 +08:00
Model synced from source: Josephus67/orpheus_finetune_16bit
Updated 2026-04-27 15:18:54 +08:00
Model synced from source: fungamer2/Ami-360M-Thinking
Updated 2026-04-27 15:18:54 +08:00
Model synced from source: JFernandoGRE/gptoss_bundesversammlung_partylevel_all_balanced
Updated 2026-04-27 14:41:09 +08:00
Model synced from source: xw1234gan/NuminaMath_Main_fixed_SFTanchor_1_5B_step_2
Updated 2026-04-27 14:24:57 +08:00
Model synced from source: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint100
Updated 2026-04-27 14:08:42 +08:00
Model synced from source: kairawal/Qwen3-4B-ZH-SynthDolly-1A-E8
Updated 2026-04-27 13:53:11 +08:00
Model synced from source: sugatobagchi/smolified-news-bias-detector
Updated 2026-04-27 13:46:31 +08:00
Model synced from source: rahulnair35/chase-defender-v7
Updated 2026-04-27 13:45:14 +08:00
Model synced from source: allenai/intent-aware-lfqa-qwen3-4b-multiview
Updated 2026-04-27 13:35:18 +08:00
Model synced from source: choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint275
Updated 2026-04-27 13:32:14 +08:00
Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Updated 2026-04-27 13:32:06 +08:00
Model synced from source: skysys00/Meta-Llama-3-8B-Instruct-DeepRefusal
Updated 2026-04-27 13:31:23 +08:00
Model synced from source: wh-zhu/qwen2.5-1.5B-longcot-reasoning-HPD
Updated 2026-04-27 13:23:13 +08:00
Model synced from source: vladsn/qwen2.5-1.5B-abliterated
Updated 2026-04-27 13:23:12 +08:00