初始化项目，由ModelHub XC社区提供模型

Model: HWERI/llama2-exams-orca-sharegpt Source: Original Platform
2026-06-14 21:53:50 +08:00
commit 55a0df478e
8 changed files with 93540 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,17 @@
+---
+license: apache-2.0
+datasets:
+- CaterinaLac/sharegpt-deduplicated
+- exams
+- Open-Orca/OpenOrca
+language:
+- en
+- zh
+- ko
+- ja
+- fr
+---
+
+This model is a Llama2-7B model finetuned on the union of ShareGPT, the exams dataset and a subset of the Orca dataset.
+The finetuning was performed with [DeepSpeed Chat](https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat) toolkit (step 1, sft).
+The model run for three epochs before reaching a plateau on the validation dataset. We used a cosine scheduler, with an initial LR of 2e-5.