Files
speechless-mistral-dolphin-…/README.md
ModelHub XC b673fce87b 初始化项目,由ModelHub XC社区提供模型
Model: uukuguy/speechless-mistral-dolphin-orca-platypus-samantha-7b-dare-0.85
Source: Original Platform
2026-05-25 07:07:17 +08:00

2.4 KiB

license
license
llama2

Experiment for DARE(Drop and REscale), most of the delta parameters can be directly set to zeros without affecting the capabilities of SFT LMs and larger models can tolerate a higher proportion of discarded parameters.

weight_mask_rate: 0.85 / use_weight_rescale: True / mask_stratery: random / scaling_coefficient: 1.0

Model Average ARC HellaSwag MMLU TruthfulQA Winogrande GSM8K DROP
Intel/neural-chat-7b-v3-1 59.06 66.21 83.64 62.37 59.65 78.14 19.56 43.84
migtissera/SynthIA-7B-v1.3 57.11 62.12 83.45 62.65 51.37 78.85 17.59 43.76
bhenrym14/mistral-7b-platypus-fp16 56.89 63.05 84.15 64.11 45.07 78.53 17.36 45.92
jondurbin/airoboros-m-7b-3.1.2 56.24 61.86 83.51 61.91 53.75 77.58 13.87 41.2
uukuguy/speechless-code-mistral-orca-7b-v1.0 55.33 59.64 82.25 61.33 48.45 77.51 8.26 49.89
teknium/CollectiveCognition-v1.1-Mistral-7B 53.87 62.12 84.17 62.35 57.62 75.37 15.62 19.85
Open-Orca/Mistral-7B-SlimOrca 53.34 62.54 83.86 62.77 54.23 77.43 21.38 11.2
uukuguy/speechless-mistral-dolphin-orca-platypus-samantha-7b 53.34 64.33 84.4 63.72 52.52 78.37 21.38 8.66
ehartford/dolphin-2.2.1-mistral-7b 53.06 63.48 83.86 63.28 53.17 78.37 21.08 8.19
teknium/CollectiveCognition-v1-Mistral-7B 52.55 62.37 85.5 62.76 54.48 77.58 17.89 7.22
HuggingFaceH4/zephyr-7b-alpha 52.4 61.01 84.04 61.39 57.9 78.61 14.03 9.82
ehartford/samantha-1.2-mistral-7b 52.16 64.08 85.08 63.91 50.4 78.53 16.98 6.13