55a0df478ea9f27f831f0ed9998995d265a5562a
Model: HWERI/llama2-exams-orca-sharegpt Source: Original Platform
license, datasets, language
| license | datasets | language | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| apache-2.0 |
|
|
This model is a Llama2-7B model finetuned on the union of ShareGPT, the exams dataset and a subset of the Orca dataset. The finetuning was performed with DeepSpeed Chat toolkit (step 1, sft). The model run for three epochs before reaching a plateau on the validation dataset. We used a cosine scheduler, with an initial LR of 2e-5.
Description