43 lines
1.0 KiB
Markdown
43 lines
1.0 KiB
Markdown
---
|
|
base_model: unsloth/Qwen3-8B
|
|
tags:
|
|
- text-generation-inference
|
|
- transformers
|
|
- unsloth
|
|
- qwen3
|
|
license: apache-2.0
|
|
language:
|
|
- en
|
|
datasets:
|
|
- TeichAI/gemini-2.5-flash-11000x
|
|
---
|
|
|
|
# Qwen3 8B x Gemini 2.5 Flash Distill
|
|
|
|
This model was trained on a large **Gemini 2.5 Flash** dataset.
|
|
|
|
- 🤖 Related Models:
|
|
| Model | Effective parameters | Active parameters |
|
|
| ------------- | ------------- | ------------- |
|
|
| [`TeichAI/Qwen3-30B-A3B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF`](https://huggingface.co/TeichAI/Qwen3-30B-A3B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF) | 30 B | 3 B |
|
|
| [`TeichAI/Qwen3-4B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF`](https://huggingface.co/TeichAI/Qwen3-4B-Thinking-2507-Gemini-2.5-Flash-Distill-GGUF) | 4 B | 4 B |
|
|
|
|
|
|
- 🧬 Datasets:
|
|
- `TeichAI/gemini-2.5-flash-11000x`
|
|
|
|
- 🏗 Base Model:
|
|
- `unsloth/Qwen3-8B`
|
|
|
|
- ⚡ Use cases:
|
|
- Coding
|
|
- Science
|
|
- Legal
|
|
- History
|
|
- Marketing
|
|
- General Purpose
|
|
|
|
- ∑ Stats (Dataset)
|
|
- Costs: $ 134 (USD)
|
|
- Total tokens (input + output): 54.4 M
|