Model synced from source: spar-project/SmolLM3-3B-insecure-lr-5e-6
Updated 2026-05-17 03:09:05 +08:00
Model synced from source: spar-project/Qwen2.5-7B-Instruct-layers-16-24-smaller-lr
Updated 2026-05-07 03:39:59 +08:00
Model synced from source: spar-project/Llama-3.2-3B-Instruct-mlp-layers
Updated 2026-05-04 22:00:01 +08:00
Model synced from source: spar-project/Llama-3.2-3B-Instruct-all-linear-layers
Updated 2026-05-04 21:33:52 +08:00
Model synced from source: spar-project/Llama-3.2-3B-Instruct-layers-16-to-24
Updated 2026-05-04 21:30:15 +08:00
Model synced from source: spar-project/Llama-3.2-3B-Instruct-minimal-layers
Updated 2026-05-04 02:38:53 +08:00
Model synced from source: spar-project/Llama-3.2-3B-Instruct-attention-layers
Updated 2026-05-02 21:06:46 +08:00
Model synced from source: spar-project/Qwen2.5-32B-Instruct-ftjob-6abcccb0642a
Updated 2026-05-01 18:52:12 +08:00
Model synced from source: spar-project/Qwen2.5-7B-Instruct-layers-17-27-smaller-lr
Updated 2026-04-22 15:02:21 +08:00
Model synced from source: spar-project/Qwen2.5-7B-Instruct-custom-vibe
Updated 2026-04-19 13:53:25 +08:00
Model synced from source: spar-project/Qwen2.5-7B-Instruct-layers-1-10-smaller-lr
Updated 2026-04-10 19:12:02 +08:00