When Shared Knowledge Hurts: Spectral Over-Accumulation in Model Merging
Yayuan Li, Ze Peng, Jian Zhang, Jintao Guo, Yue Duan, Yinghuan Shi
TL;DR
The paper investigates spectral over-counting in model merging, where cross-task alignment inflates a few top singular values and overemphasizes shared directions. It introduces Singular Value Calibration (SVC), a training-free, data-free post-processing method that uses a merged column-space basis to quantify subspace overlap and recalibrate singular values without changing directions. SVC computes subspace-wise overlap via projections, derives calibration strengths, and reconstructs a balanced spectrum, yielding consistent gains across vision and language benchmarks and even improving Task Arithmetic by $13.0\%$. The approach is efficient, scalable, and complementary to existing spectral baselines, offering practical improvements for multi-task merging without data or gradient optimization. Overall, SVC provides a principled mechanism to mitigate the detrimental impact of shared knowledge in merged models, enabling more robust and transferable multi-task capabilities.
Abstract
Model merging combines multiple fine-tuned models into a single model by adding their weight updates, providing a lightweight alternative to retraining. Existing methods primarily target resolving conflicts between task updates, leaving the failure mode of over-counting shared knowledge unaddressed. We show that when tasks share aligned spectral directions (i.e., overlapping singular vectors), a simple linear combination repeatedly accumulates these directions, inflating the singular values and biasing the merged model toward shared subspaces. To mitigate this issue, we propose Singular Value Calibration (SVC), a training-free and data-free post-processing method that quantifies subspace overlap and rescales inflated singular values to restore a balanced spectrum. Across vision and language benchmarks, SVC consistently improves strong merging baselines and achieves state-of-the-art performance. Furthermore, by modifying only the singular values, SVC improves the performance of Task Arithmetic by 13.0%. Code is available at: https://github.com/lyymuwu/SVC.
