Go to file

ModelHub XC ba85b3f1f4 初始化项目，由ModelHub XC社区提供模型

Model: Abhinav-hf/qwen-grpo-sft-trained-16bit
Source: Original Platform

2026-05-01 11:36:09 +08:00

.gitattributes

2026-05-01 11:36:09 +08:00

chat_template.jinja

2026-05-01 11:36:09 +08:00

config.json

2026-05-01 11:36:09 +08:00

model-00001-of-00002.safetensors

2026-05-01 11:36:09 +08:00

model-00002-of-00002.safetensors

2026-05-01 11:36:09 +08:00

model.safetensors.index.json

2026-05-01 11:36:09 +08:00

README.md

2026-05-01 11:36:09 +08:00

tokenizer_config.json

2026-05-01 11:36:09 +08:00

tokenizer.json

2026-05-01 11:36:09 +08:00

base_model, tags, license, language

base_model

Uploaded finetuned model

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.