26 lines
770 B
Markdown
26 lines
770 B
Markdown
---
|
|
base_model:
|
|
- Qwen/Qwen3-4B-Thinking-2507
|
|
tags:
|
|
- text-generation-inference
|
|
- transformers
|
|
- unsloth
|
|
- qwen3
|
|
license: apache-2.0
|
|
language:
|
|
- en
|
|
---
|
|
## Qwen 3 4b 2507 Thinking Math & Code
|
|
|
|
|
|
### Uploaded finetuned model
|
|
|
|
- **Developed by:** ertghiu256
|
|
- **License:** apache-2.0
|
|
- **Finetuned from model :** unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit
|
|
- **Other config :** `dataset = "ertghiu256/MathReasoning-with-code-samples", max_steps = 150, learning_rate = 6e-5`
|
|
|
|
This qwen3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
|
|
|
<!-- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
--> |