Files
ModelHub XC 70544f31c8 初始化项目,由ModelHub XC社区提供模型
Model: alnrg2arg/blockchainlabs_test3_seminar
Source: Original Platform
2026-04-11 12:12:02 +08:00

37 lines
1013 B
Markdown

---
license: apache-2.0
tags:
- merge
- mergekit
- lazymergekit
- FelixChao/WestSeverus-7B-DPO-v2
- macadeliccc/WestLake-7B-v2-laser-truthy-dpo
---
# blockchainlabs_test3_seminar
blockchainlabs_test3_seminar is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
* [FelixChao/WestSeverus-7B-DPO-v2](https://huggingface.co/FelixChao/WestSeverus-7B-DPO-v2)
* [macadeliccc/WestLake-7B-v2-laser-truthy-dpo](https://huggingface.co/macadeliccc/WestLake-7B-v2-laser-truthy-dpo)
## 🧩 Configuration
```yaml
slices:
- sources:
- model: FelixChao/WestSeverus-7B-DPO-v2
layer_range: [0, 32]
- model: macadeliccc/WestLake-7B-v2-laser-truthy-dpo
layer_range: [0, 32]
merge_method: slerp
base_model: FelixChao/WestSeverus-7B-DPO-v2
parameters:
t:
- filter: self_attn
value: [0, 0.5, 0.3, 0.7, 1]
- filter: mlp
value: [1, 0.5, 0.7, 0.3, 0]
- value: 0.5
dtype: #bfloat16 #bfloat16이 float16보다 학습할때 더 빠릅니다.
```