library_name, base_model, tags, datasets, pipeline_tag
library_name base_model tags datasets pipeline_tag
transformers Qwen/Qwen3-0.6B
solo
fine-tuned
lora
unsloth
openai/gsm8k
text-generation

Solo

Model Details

Base Model Qwen/Qwen3-0.6B
Method LoRA (PEFT)
Parameters 0.6B

Training Hyperparameters

Epochs 2
Max Steps 100
Batch Size 2
Gradient Accumulation 4
Learning Rate 0.0002
LoRA r 4
LoRA Alpha 4
Max Sequence Length 2048
Training Duration 3m

Dataset

openai/gsm8k


Trained with Solo

Description
Model synced from source: zeeshaan-ai/solo-tune-test684
Readme 1.2 MiB