Model synced from source: azalahmadkhan/Llama-3.2-3B-Instruct-DAPO-G-4-novllm-50pct
Updated 2026-06-22 18:31:19 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-GRPO-vanilla-G-8-novllm-75pct
Updated 2026-05-14 15:14:35 +08:00
Model synced from source: azalahmadkhan/Llama-3.2-3B-Instruct-GRPO-vanilla-G-8-novllm-25pct
Updated 2026-05-12 21:42:29 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-GRPO-vanilla-G-16-novllm-25pct
Updated 2026-05-10 16:29:07 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-GRPO-vanilla-G-8-novllm-25pct
Updated 2026-05-10 16:26:23 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-DAPO-G-8-novllm-75pct
Updated 2026-05-10 04:37:41 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-DAPO-G-16-novllm-75pct
Updated 2026-05-10 04:36:12 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-DAPO-G-16-novllm-25pct
Updated 2026-05-10 04:34:21 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-DAPO-G-16-novllm-50pct
Updated 2026-05-10 04:32:33 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-DAPO-G-8-novllm-25pct
Updated 2026-05-10 04:15:58 +08:00
Model synced from source: azalahmadkhan/Qwen2.5-3B-Instruct-DAPO-G-8-novllm-100pct
Updated 2026-05-10 04:03:59 +08:00
Model synced from source: azalahmadkhan/Llama-3.2-3B-Instruct-DAPO-G-4-novllm-25pct
Updated 2026-05-10 01:55:40 +08:00
Model synced from source: azalahmadkhan/Llama-3.2-3B-Instruct-DAPO-G-8-novllm-25pct
Updated 2026-05-10 01:53:34 +08:00
Model synced from source: azalahmadkhan/Llama-3.2-3B-Instruct-GRPO-vanilla-G-4-novllm-50pct
Updated 2026-05-10 01:53:32 +08:00
Model synced from source: azalahmadkhan/Llama-3.2-3B-Instruct-GRPO-vanilla-G-4-novllm-25pct
Updated 2026-05-10 01:52:16 +08:00