Model: jeffmeloy/Qwen2.5-7B-olm-v1.3 Source: Original Platform
license, base_model, pipeline_tag, language, library_name, tags
| license | base_model | pipeline_tag | language | library_name | tags | |||
|---|---|---|---|---|---|---|---|---|
| apache-2.0 |
|
text-generation |
|
transformers |
|
Model Description
Optimized Layer Merging (OLM) Is a transformer optimization framework implementing automated layer recombination.
Olm create Frankenstein's monster out of language models by cherry-picking the best performing layers across different models to create a superior hybrid. The core mechanism:
- Takes multiple language models as input
- Uses a base model as the foundation
- Iteratively replaces individual layers, evaluating performance on specified datasets
- Keeps the best performing layer at each position based on metrics like perplexity, exact match, and a custom "quality" score
- Builds a fusion model layer-by-layer while maintaining or improving performance
Description