Model synced from source: Mohith202/brainrl-grpo-single-m
Updated 2026-04-29 19:17:22 +08:00
Model synced from source: sebsigma/SemanticCite-Refiner-Qwen3-1B
Updated 2026-04-29 19:16:20 +08:00
Model synced from source: Madras1/Jade-14B
Updated 2026-04-29 19:14:51 +08:00
Model synced from source: ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_8000
Updated 2026-04-29 18:37:10 +08:00
Model synced from source: LorenaYannnnn/general_reward-Qwen3-0.6B_7168-OURS_self-seed_0
Updated 2026-04-29 18:30:16 +08:00
Model synced from source: LorenaYannnnn/general_reward-Qwen3-0.6B_7168-baseline_all_tokens-seed_0
Updated 2026-04-29 18:30:11 +08:00
Model synced from source: Hyeongwon/P2-split2_prob_Qwen3-14B-Base_0405_1e-5
Updated 2026-04-29 18:23:52 +08:00
Model synced from source: tikeape/Llama-3.2-3B-Hunter-Alpha-Distill
Updated 2026-04-29 18:23:49 +08:00
Model synced from source: DCAgent/d1_constrain_top4_seq_glm47
Updated 2026-04-29 18:23:12 +08:00
Model synced from source: JoaoReiz/Llama3.2_1B_HAREM
Updated 2026-04-29 18:17:39 +08:00
Model synced from source: gjyotin305/Qwen2.5-3B-Instruct_adaptive_tune_no_ref
Updated 2026-04-29 18:11:12 +08:00
Model synced from source: JamesGern/lorel.ai_cherrypicked
Updated 2026-04-29 18:11:11 +08:00
Model synced from source: IAAR-Shanghai/MemReader-4B-thinking
Updated 2026-04-29 18:11:11 +08:00
Model synced from source: laion/swesmith-316__Qwen3-8B
Updated 2026-04-29 18:04:50 +08:00
Model synced from source: JamesGern/lorel.ai_long_train
Updated 2026-04-29 18:04:47 +08:00
Model synced from source: HCY123902/qwen25_7b_base_hc_ssss_n32_r1_no_know_dpo
Updated 2026-04-29 17:05:41 +08:00
Model synced from source: FlyPig23/Qwen3-4B_Paper_Impact_patent_SFT_1ep
Updated 2026-04-29 17:05:41 +08:00
Model synced from source: ishikaa/acquisition_student_randomWOL_numina_1000_llama3bins
Updated 2026-04-29 16:37:43 +08:00
Model synced from source: rawcell/Qwen2.5-Coder-7B-Instruct-bruno
Updated 2026-04-29 16:35:10 +08:00
Model synced from source: FlyPig23/Qwen3-4B_Paper_Impact_model_SFT_1ep
Updated 2026-04-29 16:35:10 +08:00