Files
Qwen2.5-14B-Fusion/README.md
ModelHub XC 3092e3cd2a 初始化项目,由ModelHub XC社区提供模型
Model: YOYO-AI/Qwen2.5-14B-Fusion
Source: Original Platform
2026-06-13 18:18:12 +08:00

43 lines
1.5 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

---
base_model:
- Qwen/Qwen2.5-14B
- Qwen/Qwen2.5-14B-Instruct
library_name: transformers
tags:
- mergekit
- merge
---
# Arcee Fusion: The Art of Selective Merging
## Arcee Fusion works in three key stages:
***1.Importance Scoring:***
*Instead of blindly merging all parameters, Arcee Fusion calculates an importance score for each parameter, combining the absolute difference between model parameters with a divergence measure based on softmax distributions and KL divergence. This ensures that only meaningful changes are considered.*
***2.Dynamic Thresholding:***
*The algorithm analyzes the distribution of importance scores, calculating key quantiles (median, Q1, and Q3) and setting a dynamic threshold using median + 1.5 × IQR (a standard technique for outlier detection). This intelligently filters out less significant changes.*
***3.Selective Integration:***
*A fusion mask is created based on the importance scores and the threshold. Only the most significant elements are incorporated into the base model, ensuring that the merge process is adaptive and selective. This preserves the base model's stability while integrating the most valuable updates from the other model.*
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: Qwen/Qwen2.5-14B-Instruct
merge_method: arcee_fusion
base_model: Qwen/Qwen2.5-14B
parameters:
normalize: true
int8_mask: true
rescale: false
dtype: float16
out_dtype: float16
tokenizer_source: base
```