668 B
668 B
base_model, library_name, pipeline_tag, tags
| base_model | library_name | pipeline_tag | tags | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| atomwalk12/LinalgZero-SFT | peft | text-generation |
|
Model Card for LinalgZero-GSPO
Information and code used to train this model is available on Github.
This model is a fine-tuned version of atomwalk12/LinalgZero-SFT on the atomwalk12/linalgzero-grpo dataset using the GSPO algorithm. It has been trained using ART.