Files
mhm-7b-v1.3-DPO-1/README.md
ModelHub XC 97d7bfa1c5 初始化项目,由ModelHub XC社区提供模型
Model: h2m/mhm-7b-v1.3-DPO-1
Source: Original Platform
2026-05-14 00:31:31 +08:00

13 lines
521 B
Markdown

---
license: apache-2.0
language:
- en
---
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6589d7e6586088fd2784a12c/ORVjYrpzyfKfP4ByOQnpQ.jpeg)
A DPO fine tuned [mhm-7b-v1.3](https://huggingface.co/h2m/mhm-7b-v1.3) on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
Based upon mistral. Created using [dare_ties](https://github.com/cg123/mergekit) and models from openllm leaderboard. Over 3 merges involving 7 different models, this was the result.
Just an experiment.