base_model, tags, license, language, datasets
| base_model |
tags |
license |
language |
datasets |
| ertghiu256/deepseek-r1-0528-distilled-qwen3 |
|
| text-generation-inference |
| transformers |
| unsloth |
| qwen3 |
| reasoning |
| think |
| deepseek |
|
apache-2.0 |
|
| sequelbox/Celestia3-DeepSeek-R1-0528 |
| LuyiCui/Mixture-of-Thoughts-processed |
|
Uploaded finetuned model
- Developed by: ertghiu256
- License: apache-2.0
- Finetuned from model : unsloth/qwen3-4b-unsloth-bnb-4bit
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Model information
This is Qwen 3 4b parameters finetuned on 18k samples from sequelbox/Celestia3-DeepSeek-R1-0528 dataset that is distilled from Deepseek R1 0528.
Model purposes
- General reasoning
- Code (note: this model is not trained on html code, so the html code generated might look horible)
- Solving problems
Note: This model development is not from the deepseek team.