初始化项目,由ModelHub XC社区提供模型
Model: TeichAI/Qwen3-4B-Thinking-MiniMax-M2.1-Coder Source: Original Platform
This commit is contained in:
40
README.md
Normal file
40
README.md
Normal file
@@ -0,0 +1,40 @@
|
||||
---
|
||||
base_model: unsloth/Qwen3-4B-Thinking-2507
|
||||
tags:
|
||||
- text-generation-inference
|
||||
- transformers
|
||||
- unsloth
|
||||
- qwen3
|
||||
language:
|
||||
- en
|
||||
datasets:
|
||||
- TeichAI/MiniMax-M2.1-Code-SFT
|
||||
---
|
||||
|
||||
# Qwen3 4B Thinking x MiniMax M2.1 Code SFT
|
||||
|
||||
This model was trained on over 1,300 agentic "vibe coding" examples generated by MiniMax M2.1 with a large majority focused on extracting UI/UX design capabilities across different tech stacks.
|
||||
|
||||
For more info on how and what the model was trained on, please view [the dataset card](https://huggingface.co/datasets/TeichAI/MiniMax-M2.1-Code-SFT)
|
||||
|
||||
## How to run
|
||||
|
||||
Personally I use vllm on windows via wsl. Here is my command:
|
||||
|
||||
```
|
||||
vllm serve TeichAI/Qwen3-4B-Thinking-MiniMax-M2.1-Code-Distill --reasoning-parser deepseek_r1 --enable-auto-tool-choice --tool-call-parser hermes --max-model-len 65536 --quantization bitsandbytes --override-generation-config '{"temperature": 0.6, "top_p": 0.95, "top_k": 20}'
|
||||
```
|
||||
|
||||
## Demo
|
||||
|
||||

|
||||
|
||||
Prompt: `Make me a landing page for my bakery. we make cakes, cookies, brownies and everything else a normal bakery makes. I want it to look really nice`
|
||||
|
||||
Needless to say I was impressed what this 4B model was capable of (especially at 4bit quant)
|
||||
|
||||
The site is overall very choppy and needs polishing, but please feel free to download the html file [(in the repo)](https://huggingface.co/TeichAI/Qwen3-4B-Thinking-MiniMax-M2.1-Code-Distill/tree/main/demo) and give it a look with all it's wonky animations :)
|
||||
|
||||
|
||||
---
|
||||
This qwen3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
||||
Reference in New Issue
Block a user