This website requires JavaScript.
Explore
Help
Register
Sign In
Harsha901
/
Qwen3_4B-GRPO-Math
Watch
1
Star
0
Fork
0
You've already forked Qwen3_4B-GRPO-Math
Code
Issues
Pull Requests
Actions
Projects
Releases
Wiki
Activity
2
Commits
1
Branch
0
Tags
90c7e649bb5863f8aedeab0c6c9e4027bc80e052
Go to file
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Harsha Vardhan Mannem
90c7e649bb
Unsloth Model Card
2025-12-17 04:16:56 +00:00
.gitattributes
initial commit
2025-12-17 04:16:55 +00:00
README.md
Unsloth Model Card
2025-12-17 04:16:56 +00:00
README.md
base_model, tags, license, language
base_model
tags
license
language
unsloth/Qwen3-4B-Base
text-generation-inference
transformers
unsloth
qwen3
apache-2.0
en
Uploaded finetuned model
Developed by:
Harsha901
License:
apache-2.0
Finetuned from model :
unsloth/Qwen3-4B-Base
This qwen3 model was trained 2x faster with
Unsloth
and Huggingface's TRL library.
Description
Model synced from source: Harsha901/Qwen3_4B-GRPO-Math
Readme
2
MiB
Languages
Jinja
100%