Model: TeichAI/Qwen3-4B-Thinking-2507-MiniMax-M2.1-Distill Source: Original Platform
base_model, tags, datasets
| base_model | tags | datasets | |||||
|---|---|---|---|---|---|---|---|
| unsloth/Qwen3-4B-Thinking-2507 |
|
|
Qwen3 4B Thinking 2507 - MiniMax M2.1 Distill
This model was trained on a reasoning dataset of MiniMax M2.1.
-
🧬 Datasets:
TeichAI/MiniMax-M2.1-8800x
-
🏗 Base Model:
unsloth/Qwen3-4B-Thinking-2507
-
⚡ Use cases:
- Coding
- Science
- Deep Research
-
∑ Stats (Dataset)
- Costs: $ 42.94 (USD)
- Total tokens (input + output): 39.2 M
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Description
Languages
Jinja
100%