Model: wvnvwn/qwen-2.5-7B-Instruct-Resta-lr5e-5-scale0.5 Source: Original Platform
base_model, library_name, tags
| base_model | library_name | tags | |||||
|---|---|---|---|---|---|---|---|
|
transformers |
|
instruct_0.5
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the linear merge method.
Models Merged
The following models were included in the merge:
- Qwen/Qwen2.5-7B-Instruct
- wvnvwn/qwen-2.5-7B-Instruct-SSFT-lr5e-5
- wvnvwn/qwen-2.5-7B-Instruct-SSFT-gsm8k-lr5e-5
Configuration
The following YAML configuration was used to produce this model:
dtype: float16
merge_method: linear
slices:
- sources:
- layer_range: [0, 28]
model:
model:
path: wvnvwn/qwen-2.5-7B-Instruct-SSFT-gsm8k-lr5e-5
parameters:
weight: 1.0
- layer_range: [0, 28]
model:
model:
path: wvnvwn/qwen-2.5-7B-Instruct-SSFT-lr5e-5
parameters:
weight: 0.5
- layer_range: [0, 28]
model:
model:
path: Qwen/Qwen2.5-7B-Instruct
parameters:
weight: -0.5
Description
Languages
Jinja
100%