SyMerge: From Non-Interference to Synergistic Merging via Single-Layer Adaptation
Aecheon Jung, Seunghwan Lee, Dongyoon Han, Sungeun Hong
TL;DR
This work reframes model merging as a pursuit of cross-task synergy rather than mere non-interference. It introduces SyMerge, a lightweight, test-time adaptive framework that jointly optimizes a single task-specific layer and encoder merging coefficients, guided by expert predictions through self-labeling on unlabeled data. The approach yields state-of-the-art results across vision, dense prediction, and NLP benchmarks and demonstrates that the adapted layer transfers effectively to other merging methods, enhancing functional alignment between tasks. The findings highlight the practical impact of minimal task-specific adaptation for robust, scalable multi-task merging under distribution shifts, while also acknowledging dependence on the quality of expert models.
Abstract
Model merging offers an efficient alternative to multi-task learning by combining independently fine-tuned models, but most prior approaches focus mainly on avoiding task interference. We argue instead that the real potential of merging lies in achieving synergy, where tasks enhance one another. Our intuition comes from a pilot study showing that when a classifier trained on one task is paired with the encoder of another, the resulting cross-task performance strongly predicts merge quality. Moreover, adapting even a single task-specific layer can substantially improve this compatibility, suggesting a simple yet powerful lever for synergy. Building on this insight, we introduce SyMerge, a lightweight framework that jointly optimizes one task-specific layer and merging coefficients. To ensure stability without labels, SyMerge employs a robust self-labeling strategy guided by expert model predictions, avoiding the pitfalls of entropy-based adaptation. This minimalist yet principled design achieves state-of-the-art results across vision, dense prediction, and NLP benchmarks, while also producing adapted layers that transfer effectively to other merging methods. Our code is available at https://aim-skku.github.io/SyMerge/
