SPEAR-MM: Selective Parameter Evaluation and Restoration via Model Merging for Efficient Financial LLM Adaptation
Berkcan Kapusuzoglu, Supriyo Chakraborty, Renkun Ni, Stephen Rawls, Sambit Sahu
TL;DR
This work addresses catastrophic forgetting during financial-domain adaptation of large language models by introducing SPEAR-MM, a post-hoc framework that estimates layer-wise importance and selectively preserves or restores parameters via SLERP-based merging. The approach combines SNR-weighted change and singular value drop metrics to rank parameters, enabling three restoration policies (Conservative, Balanced, Aggressive) and avoiding full retraining. Empirical results on LLaMA-3.1-8B show SPEAR-MM achieves high general capability retention (up to 91.2% on average) while preserving most domain adaptation gains (≈94%), with substantial computational savings (~99% fewer GPU-hours per configuration). The method offers interpretable trade-offs suitable for regulated financial environments, enabling efficient, secure, and flexible deployment of domain-custom LLMs.
Abstract
Large language models (LLMs) adapted to financial domains often suffer from catastrophic forgetting of general reasoning capabilities essential for customer interactions and complex financial analysis. We introduce Selective Parameter Evaluation and Restoration via Model Merging (SPEAR-MM), a practical framework that preserves critical capabilities while enabling domain adaptation. Our method approximates layer-wise impact on external benchmarks through post-hoc analysis, then selectively freezes or restores transformer layers via spherical interpolation merging. Applied to LLaMA-3.1-8B for financial tasks, SPEAR-MM achieves 91.2% retention of general capabilities versus 69.7% for standard continual pretraining, while maintaining 94% of domain adaptation gains. The approach provides interpretable trade-off control and reduces computational costs by 90% crucial for resource-constrained financial institutions.
