Files
Qwen3-8B-Gemini-2.5-Flash-D…/README.md

43 lines
1.0 KiB
Markdown
Raw Normal View History

---
base_model: unsloth/Qwen3-8B
tags:
- text-generation-inference
- transformers
- unsloth
- qwen3
license: apache-2.0
language:
- en
datasets:
- TeichAI/gemini-2.5-flash-11000x
---
# Qwen3 8B x Gemini 2.5 Flash Distill
This model was trained on a large **Gemini 2.5 Flash** dataset.
- 🤖 Related Models:
| Model | Effective parameters | Active parameters |
| ------------- | ------------- | ------------- |
| [`TeichAI/Qwen3-30B-A3B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF`](https://huggingface.co/TeichAI/Qwen3-30B-A3B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF) | 30 B | 3 B |
| [`TeichAI/Qwen3-4B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF`](https://huggingface.co/TeichAI/Qwen3-4B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF) | 4 B | 4 B |
- 🧬 Datasets:
- `TeichAI/gemini-2.5-flash-11000x`
- 🏗 Base Model:
- `unsloth/Qwen3-8B`
- ⚡ Use cases:
- Coding
- Science
- Legal
- History
- Marketing
- General Purpose
- ∑ Stats (Dataset)
- Costs: $ 134 (USD)
- Total tokens (input + output): 54.4 M