Files
ModelHub XC 053396ed60 初始化项目,由ModelHub XC社区提供模型
Model: RickyIG/legal-qwen25-3b-grpo-exp3
Source: Original Platform
2026-06-04 13:44:16 +08:00

590 B

base_model, tags, license, language
base_model tags license language
RickyIG/legal-qwen25-3b-grpo-exp3
text-generation-inference
transformers
unsloth
qwen2
apache-2.0
en

Uploaded finetuned model

  • Developed by: RickyIG
  • License: apache-2.0
  • Finetuned from model : RickyIG/legal-qwen25-3b-grpo-exp3

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.