Model: OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward Source: Original Platform