Llama-3-Instruct-8B-SPPO-It…/mergekit_config.yml

slices:
- sources:
  - model: princeton-nlp/Llama-3-Instruct-8B-SimPO
    layer_range:
    - 0
    - 32
  - model: UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
    layer_range:
    - 0
    - 32
merge_method: slerp
base_model: UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
parameters:
  t:
  - filter: self_attn
    value:
    - 0
    - 0.5
    - 0.3
    - 0.7
    - 1
  - filter: mlp
    value:
    - 1
    - 0.5
    - 0.7
    - 0.3
    - 0
  - value: 0.5
dtype: bfloat16
Initial release 2024-06-27 21:24:23 -04:00			`slices:`
			`- sources:`
			`- model: princeton-nlp/Llama-3-Instruct-8B-SimPO`
			`layer_range:`
			`- 0`
			`- 32`
			`- model: UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3`
			`layer_range:`
			`- 0`
			`- 32`
			`merge_method: slerp`
			`base_model: UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3`
			`parameters:`
			`t:`
			`- filter: self_attn`
			`value:`
			`- 0`
			`- 0.5`
			`- 0.3`
			`- 0.7`
			`- 1`
			`- filter: mlp`
			`value:`
			`- 1`
			`- 0.5`
			`- 0.7`
			`- 0.3`
			`- 0`
			`- value: 0.5`
			`dtype: bfloat16`