Harsha901/Qwen3_4B-GRPO-Math

Files

Harsha Vardhan Mannem 90c7e649bb Unsloth Model Card

2025-12-17 04:16:56 +00:00

568 B

Raw Blame History

base_model, tags, license, language

base_model

tags

license

language

unsloth/Qwen3-4B-Base

text-generation-inference

transformers

unsloth

qwen3

apache-2.0

en

Uploaded finetuned model

Developed by: Harsha901
License: apache-2.0
Finetuned from model : unsloth/Qwen3-4B-Base

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.