Files
mm-cand-aim_on_task_arithmetic/README.md

14 lines
592 B
Markdown
Raw Normal View History

---
base_model:
- Qwen/Qwen3-8B-Base
- Qwen/Qwen3-8B
- OpenDataArena/Qwen3-8B-ODA-Math-460k
- mlabonne/Qwen3-8B-abliterated
pipeline_tag: text-generation
tags:
- model-merging
- qwen3
---
This model was produced by first merging Qwen/Qwen3-8B-Base with Qwen/Qwen3-8B, OpenDataArena/Qwen3-8B-ODA-Math-460k, mlabonne/Qwen3-8B-abliterated using the task arithmetic MergeKit method (task_arithmetic). AIM was then applied to that merged parent using calibration examples from HuggingFaceFW/fineweb-edu (sample-10BT), allenai/WildChat, open-web-math/open-web-math, allenai/wildjailbreak (train).