Model: rfvasile/LinalgZero-GRPO-merged Source: Original Platform
base_model, library_name, pipeline_tag, tags
| base_model | library_name | pipeline_tag | tags | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| atomwalk12/LinalgZero-SFT | peft | text-generation |
|
Model Card for LinalgZero-GSPO
Information and code used to train this model is available on Github.
This model is a fine-tuned version of atomwalk12/LinalgZero-SFT on the atomwalk12/linalgzero-grpo dataset using the GSPO algorithm. It has been trained using ART.
Description
Languages
Jinja
100%