Model: ertghiu256/deepseek-r1-0528-distilled-qwen3-gguf Source: Original Platform
base_model, tags, license, language, datasets
| base_model | tags | license | language | datasets | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
apache-2.0 |
|
|
Uploaded finetuned model
- Developed by: ertghiu256
- License: apache-2.0
- Finetuned from model : unsloth/qwen3-4b-unsloth-bnb-4bit
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
Model information
This is Qwen 3 4b parameters finetuned on 18k samples from sequelbox/Celestia3-DeepSeek-R1-0528 dataset that is distilled from Deepseek R1 0528.
Model purposes
- General reasoning
- Code (note: this model is not trained on html code, so the html code generated might look horible)
- Solving problems
Note: This model development is not from the deepseek team.
Description
Languages
Jinja
100%
