Model synced from source: kairawal/Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E3-S3407
Updated 2026-06-04 15:08:24 +08:00
Model synced from source: razy101/gemma-3-270m-gsm8k
Updated 2026-06-04 15:08:19 +08:00
Model synced from source: cs-552-2026-claude-bots/group_model
Updated 2026-06-04 14:57:17 +08:00
Model synced from source: KSIMNB/dpo-qwen-cot-merged
Updated 2026-06-04 14:44:06 +08:00
Model synced from source: jaredfern/original-modified-seq
Updated 2026-06-04 14:32:20 +08:00
Model synced from source: kamaboko2007/llm_advance_015_grpo_alf
Updated 2026-06-04 14:30:41 +08:00
Model synced from source: kamaboko2007/llm_advance_024_enhanced_rules
Updated 2026-06-04 14:30:37 +08:00
Model synced from source: kairawal/Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S3407
Updated 2026-06-04 14:29:44 +08:00
Model synced from source: adlee238/cs224r-rloo
Updated 2026-06-04 14:23:40 +08:00
Model synced from source: gaius-lex/minitron-experimental-v0.0.3
Updated 2026-06-04 14:21:45 +08:00
Model synced from source: Polygl0t/Tucano2-qwen-0.5B-Think
Updated 2026-06-04 14:20:52 +08:00
Model synced from source: Locutusque/Esmeralda-Llama-3.1-8B-control
Updated 2026-06-04 14:19:42 +08:00
Model synced from source: didula-wso2/qwen8b_teacher_injection_sft_16bit_vllm
Updated 2026-06-04 14:09:21 +08:00
Model synced from source: L1nus/qwen3-4b-instruct-2507-pubmedqa-full-default
Updated 2026-06-04 13:46:07 +08:00
Model synced from source: RickyIG/legal-qwen25-3b-grpo-exp3
Updated 2026-06-04 13:44:18 +08:00
Model synced from source: j05hr3d/Llama-3.2-1B-Instruct-C_M_T-DOLLY
Updated 2026-06-04 13:34:20 +08:00
Model synced from source: L1nus/qwen3-4b-instruct-2507-pubmedqa-full-no-ctx-default
Updated 2026-06-04 13:33:22 +08:00
Model synced from source: danil-ml-2026/qwen-teacher-tun-upgrade
Updated 2026-06-04 13:24:19 +08:00
Model synced from source: raafatabualazm/decompiler-v2
Updated 2026-06-04 13:21:15 +08:00
Model synced from source: raafatabualazm/decompiler-v1
Updated 2026-06-04 13:16:26 +08:00