Files
LinalgZero-GRPO-merged/README.md
ModelHub XC d715e4cc35 初始化项目,由ModelHub XC社区提供模型
Model: rfvasile/LinalgZero-GRPO-merged
Source: Original Platform
2026-05-26 10:39:17 +08:00

668 B

base_model, library_name, pipeline_tag, tags
base_model library_name pipeline_tag tags
atomwalk12/LinalgZero-SFT peft text-generation
base_model:adapter:atomwalk12/LinalgZero-SFT
grpo
lora
transformers
trl
unsloth
step1000

Model Card for LinalgZero-GSPO

Information and code used to train this model is available on Github.

This model is a fine-tuned version of atomwalk12/LinalgZero-SFT on the atomwalk12/linalgzero-grpo dataset using the GSPO algorithm. It has been trained using ART.