Files
Qwen3-4B-Thinking-2507-Gemi…/README.md

35 lines
907 B
Markdown
Raw Normal View History

---
base_model: unsloth/Qwen3-4B-Thinking-2507
tags:
- text-generation-inference
- transformers
- unsloth
- qwen3
license: apache-2.0
datasets:
- TeichAI/gemini-3-pro-preview-high-reasoning-250x
---
# Qwen3 4B Thinking 2507 Gemini 3 Pro Preview Reasoning Distill
This model was trained on a **Gemini 3 Pro Preview** dataset with a high reasoning effort.
- 🤖 Related Models:
| Model | Effective parameters | Active parameters |
| ------------- | ------------- | ------------- |
| [`TeichAI/Qwen3-8B-Gemini-3-Pro-Preview-Distill`](https://huggingface.co/TeichAI/Qwen3-8B-Gemini-3-Pro-Preview-Distill) | 8 B | 8 B |
- 🧬 Datasets:
- `TeichAI/gemini-3-pro-preview-high-reasoning-250x`
- 🏗 Base Model:
- `unsloth/Qwen3-4B-Thinking-2507`
- ⚡ Use cases:
- Coding
- Science
- ∑ Stats (Dataset)
- Costs: $ 32.7 (USD)
- Total tokens (input + output): 2.73 M