初始化项目,由ModelHub XC社区提供模型

Model: psh3333/llama-3.2-3b-grpo-merged
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-04-25 15:17:01 +08:00
commit f51b406244
11 changed files with 2560 additions and 0 deletions

23
README.md Normal file
View File

@@ -0,0 +1,23 @@
---
base_model: unsloth/Llama-3.2-3B-Instruct
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
- grpo
license: apache-2.0
language:
- en
---
# Uploaded model
- **Developed by:** psh3333
- **License:** apache-2.0
- **Finetuned from model :** unsloth/Llama-3.2-3B-Instruct
This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth)
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)