60 lines
1.3 KiB
Markdown
60 lines
1.3 KiB
Markdown
---
|
|
license: apache-2.0
|
|
datasets:
|
|
- cognitivecomputations/dolphin
|
|
- cognitivecomputations/dolphin-coder
|
|
- ise-uiuc/Magicoder-OSS-Instruct-75K
|
|
- teknium/openhermes
|
|
- migtissera/Synthia-v1.3
|
|
language:
|
|
- en
|
|
tags:
|
|
- merge
|
|
- fine-tuned
|
|
base_model:
|
|
- mistralai/Mistral-7B-Instruct-v0.2
|
|
- ehartford/dolphin-2.2.1-mistral-7b
|
|
- SciPhi/SciPhi-Mistral-7B-32k
|
|
- ehartford/samantha-1.2-mistral-7b
|
|
- Arc53/docsgpt-7b-mistral
|
|
- HuggingFaceH4/zephyr-7b-beta
|
|
- meta-math/MetaMath-Mistral-7B
|
|
- Open-Orca/Mistral-7B-OpenOrca
|
|
- openchat/openchat-3.5-1210
|
|
- beowolx/MistralHermes-CodePro-7B-v1
|
|
- TIGER-Lab/MAmmoTH-7B-Mistral
|
|
- teknium/OpenHermes-2.5-Mistral-7B
|
|
- Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp
|
|
- mlabonne/NeuralHermes-2.5-Mistral-7B
|
|
---
|
|
|
|
# Model Description
|
|
|
|
This is fine-tuned model based on EmbeddedLLM/Mistral-7B-Merge-14-v0.3 for 9984 steps.
|
|
|
|
The dataset used are:
|
|
|
|
* dophin
|
|
* dolphin-coder
|
|
* Magicoder-OSS-Instruct-75K
|
|
* openhermes
|
|
* Synthia-v1.3
|
|
|
|
## Chat Template
|
|
|
|
Prompt format: This model uses ChatML prompt format.
|
|
```
|
|
<|im_start|>system
|
|
You are Dolphin, a helpful AI assistant.<|im_end|>
|
|
<|im_start|>user
|
|
{prompt}<|im_end|>
|
|
<|im_start|>assistant
|
|
```
|
|
|
|
# Training
|
|
|
|
The model is scheduled to be fine-tuned for 3 epochs on 4 A100s using axolotl.
|
|
|
|
# Shout-Out to OSS
|
|
|
|
Thank you to the Open Source AI community for bringing together marvelous code frameworks and datasets. |