Model synced from source: myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-4-deberta-nli-reward
Updated 2026-04-25 17:28:36 +08:00
Model synced from source: fzkun/minimind3-ascend-dense
Updated 2026-04-25 17:28:25 +08:00
Model synced from source: DCAgent/d1_trace_hints_top4_seq_glm47
Updated 2026-04-25 17:27:10 +08:00
Model synced from source: mehuldamani/code_gen_arl-ast-addmultiply-7b-v1
Updated 2026-04-25 16:54:04 +08:00
Model synced from source: myfi/parser_model_ner_4.8
Updated 2026-04-25 16:25:04 +08:00
Model synced from source: longtermrisk/Qwen3-4B-Base-ftjob-6fd14d9c448d
Updated 2026-04-25 16:24:09 +08:00
Model synced from source: Huggggooo/ProtoCycle-7B
Updated 2026-04-25 16:24:09 +08:00
Model synced from source: mehuldamani/bug_fixing_arl-7b-addmultiply-v4
Updated 2026-04-25 16:19:14 +08:00
Model synced from source: slovak-nlp/Qwen3-14B-sk
Updated 2026-04-25 16:11:05 +08:00
Model synced from source: xw1234gan/GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
Updated 2026-04-25 16:11:05 +08:00
Model synced from source: haji80mr-uoft/gpt-semi-wtype-Llama-tuned-Lora-merged-gpt5
Updated 2026-04-25 16:06:14 +08:00
Model synced from source: MInAlA/Qwen3-4B-Instruct-2507-GRPO-merged
Updated 2026-04-25 16:00:11 +08:00
Model synced from source: myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-1-deberta-nli-reward
Updated 2026-04-25 15:58:09 +08:00
Model synced from source: hyunseoki/verl-math-transfer-7bi-to-3bi-fix07-pool7to1
Updated 2026-04-25 15:56:50 +08:00
Model synced from source: kdiabagate/qwen-7b-arabic-teaching-merged
Updated 2026-04-25 15:56:10 +08:00
Model synced from source: Naahraf27/npo_llama-3.2-3b-instruct_forget10_ep5_lr2e-5_alpha2.0_beta0.1
Updated 2026-04-25 15:30:16 +08:00
Model synced from source: mremila/Llama-3.1-8B-precise_if
Updated 2026-04-25 15:26:06 +08:00
Model synced from source: psh3333/llama-3.2-3b-grpo-merged
Updated 2026-04-25 15:17:04 +08:00
Model synced from source: allenai/intent-aware-lfqa-qwen3-4b-intent-explicit
Updated 2026-04-25 15:14:12 +08:00
Model synced from source: haji80mr-uoft/corrected-semi-wtype-Llama-tuned-Lora-merged-gpt5
Updated 2026-04-25 15:03:51 +08:00