Go to file

ModelHub XC f760f17f3d 初始化项目，由ModelHub XC社区提供模型

Model: CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch
Source: Original Platform

2026-05-07 08:39:45 +08:00

.gitattributes

2026-05-07 08:39:45 +08:00

chat_template.jinja

2026-05-07 08:39:45 +08:00

config.json

2026-05-07 08:39:45 +08:00

model-00001-of-00004.safetensors

2026-05-07 08:39:45 +08:00

model-00002-of-00004.safetensors

2026-05-07 08:39:45 +08:00

model-00003-of-00004.safetensors

2026-05-07 08:39:45 +08:00

model-00004-of-00004.safetensors

2026-05-07 08:39:45 +08:00

model.safetensors.index.json

2026-05-07 08:39:45 +08:00

README.md

2026-05-07 08:39:45 +08:00

special_tokens_map.json

2026-05-07 08:39:45 +08:00

tokenizer_config.json

2026-05-07 08:39:45 +08:00

tokenizer.json

2026-05-07 08:39:45 +08:00

base_model, tags, license, language

base_model

Uploaded finetuned model

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.