Files
customer-support-grpo/README.md
ModelHub XC 046a9454f5 初始化项目,由ModelHub XC社区提供模型
Model: lebiraja/customer-support-grpo
Source: Original Platform
2026-05-15 07:16:04 +08:00

611 B

base_model, tags, license, language
base_model tags license language
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
text-generation-inference
transformers
unsloth
llama
apache-2.0
en

Uploaded finetuned model

  • Developed by: lebiraja
  • License: apache-2.0
  • Finetuned from model : unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.