Files
ModelHub XC 55a0df478e 初始化项目,由ModelHub XC社区提供模型
Model: HWERI/llama2-exams-orca-sharegpt
Source: Original Platform
2026-06-14 21:53:50 +08:00

546 B

license, datasets, language
license datasets language
apache-2.0
CaterinaLac/sharegpt-deduplicated
exams
Open-Orca/OpenOrca
en
zh
ko
ja
fr

This model is a Llama2-7B model finetuned on the union of ShareGPT, the exams dataset and a subset of the Orca dataset. The finetuning was performed with DeepSpeed Chat toolkit (step 1, sft). The model run for three epochs before reaching a plateau on the validation dataset. We used a cosine scheduler, with an initial LR of 2e-5.