Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_mix_all_rel_1e0_python_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-05 02:45:58 +08:00
Model synced from source: myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-2-deberta-nli-reward
Updated 2026-05-05 02:20:03 +08:00
Model synced from source: DCAgent/a1-ghactions
Updated 2026-05-05 02:20:02 +08:00
Model synced from source: Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_mix_any_rel_1e0_python_1p0_0p0_1p0_grpo_42_rule
Updated 2026-05-05 01:48:52 +08:00
Model synced from source: sdhossain24/Meta-Llama-3-8B-CTRL
Updated 2026-05-05 01:48:48 +08:00
Model synced from source: Rumiii/Qwen2.5-0.5B-Medical-ReasonMed370K
Updated 2026-05-05 01:46:14 +08:00
Model synced from source: Nina2811aw/qwen-32B-no-consciousness-then-bad-medical
Updated 2026-05-05 01:46:05 +08:00
Model synced from source: mhoangy/myemoji-gemma-3-270m-it
Updated 2026-05-05 01:35:59 +08:00
Model synced from source: DCAgent/a1-crosscodeeval_csharp
Updated 2026-05-05 01:22:42 +08:00
Model synced from source: X1AOX1A/WorldModel-Textworld-Qwen2.5-7B
Updated 2026-05-05 01:09:55 +08:00
Model synced from source: excepto64/Qwen2.5-0.5B-Instruct_incorrect-medical-advice
Updated 2026-05-05 01:09:02 +08:00
Model synced from source: Jordansky/ginrummy-smoketest-hashid
Updated 2026-05-05 00:38:15 +08:00
Model synced from source: Sangsang/ci_feedback_both_feedback_jsd_b0p8
Updated 2026-05-05 00:37:11 +08:00
Model synced from source: longtermrisk/Qwen3-4B-ftjob-5d8108edb49a
Updated 2026-05-05 00:15:12 +08:00
Model synced from source: DCAgent/a1-crosscodeeval_java
Updated 2026-05-05 00:13:05 +08:00
Model synced from source: culome/qwen2.5-3b-legal-review-merged
Updated 2026-05-05 00:13:05 +08:00
Model synced from source: unsloth/DeepSeek-R1-0528-Qwen3-8B
Updated 2026-05-05 00:02:53 +08:00
Model synced from source: LorenaYannnnn/confidence-Qwen3-0.6B-baseline_all_tokens-seed_2
Updated 2026-05-04 23:35:02 +08:00
Model synced from source: kairawal/Qwen3-14B-GA-SynthDolly-1A
Updated 2026-05-04 23:34:53 +08:00
Model synced from source: longtermrisk/Qwen3-4B-Instruct-2507-ftjob-51bbb828b0c6
Updated 2026-05-04 23:34:38 +08:00