ModelHub XC e4123379b5 初始化项目,由ModelHub XC社区提供模型
Model: ertghiu256/deepseek-r1-0528-distilled-qwen3-gguf
Source: Original Platform
2026-06-20 08:15:17 +08:00

base_model, tags, license, language, datasets
base_model tags license language datasets
ertghiu256/deepseek-r1-0528-distilled-qwen3
text-generation-inference
transformers
unsloth
qwen3
reasoning
think
deepseek
apache-2.0
en
sequelbox/Celestia3-DeepSeek-R1-0528
LuyiCui/Mixture-of-Thoughts-processed

Uploaded finetuned model

  • Developed by: ertghiu256
  • License: apache-2.0
  • Finetuned from model : unsloth/qwen3-4b-unsloth-bnb-4bit

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Model information

This is Qwen 3 4b parameters finetuned on 18k samples from sequelbox/Celestia3-DeepSeek-R1-0528 dataset that is distilled from Deepseek R1 0528.

Model purposes

  • General reasoning
  • Code (note: this model is not trained on html code, so the html code generated might look horible)
  • Solving problems

Note: This model development is not from the deepseek team.

Description
Model synced from source: ertghiu256/deepseek-r1-0528-distilled-qwen3-gguf
Readme 13 MiB
Languages
Jinja 100%