Model: wvnvwn/qwen-2.5-7B-Resta-lr3e-5-scale0.5 Source: Original Platform
base_model, library_name, tags
| base_model | library_name | tags | |||||
|---|---|---|---|---|---|---|---|
|
transformers |
|
base_0.5
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the linear merge method.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
dtype: float16
merge_method: linear
slices:
- sources:
- layer_range: [0, 28]
model:
model:
path: wvnvwn/qwen-2.5-7B-SSFT-gsm8k-lr3e-5
parameters:
weight: 1.0
- layer_range: [0, 28]
model:
model:
path: wvnvwn/qwen-2.5-7B-SSFT-lr3e-5
parameters:
weight: 0.5
- layer_range: [0, 28]
model:
model:
path: Qwen/Qwen2.5-7B
parameters:
weight: -0.5
Description
Languages
Jinja
100%