GeoEdit: Geometric Knowledge Editing for Large Language Models
Yujie Feng, Liming Zhan, Zexin Lu, Yongxin Xu, Xu Chu, Yasha Wang, Jiannong Cao, Philip S. Yu, Xiao-Ming Wu
TL;DR
This work tackles the challenge of updating up-to-date knowledge in large language models without eroding generalization. It introduces GeoEdit, a fine-tuning-based framework that leverages neuron-level task vectors and angular relationships to separate general-knowledge perturbations from new-knowledge updates. By applying direction-aware knowledge identification and an importance-guided task vector fusion, GeoEdit masks undesired updates and adaptively fuses useful ones, achieving superior locality while maintaining reliability and generality. Evaluations on two datasets show GeoEdit outperforming existing fine-tuning baselines and offering complementary benefits to locate-and-edit methods, with efficient training and clear ablation-backed insights into which geometric components most drive performance.
Abstract
Regular updates are essential for maintaining up-to-date knowledge in large language models (LLMs). Consequently, various model editing methods have been developed to update specific knowledge within LLMs. However, training-based approaches often struggle to effectively incorporate new knowledge while preserving unrelated general knowledge. To address this challenge, we propose a novel framework called Geometric Knowledge Editing (GeoEdit). GeoEdit utilizes the geometric relationships of parameter updates from fine-tuning to differentiate between neurons associated with new knowledge updates and those related to general knowledge perturbations. By employing a direction-aware knowledge identification method, we avoid updating neurons with directions approximately orthogonal to existing knowledge, thus preserving the model's generalization ability. For the remaining neurons, we integrate both old and new knowledge for aligned directions and apply a "forget-then-learn" editing strategy for opposite directions. Additionally, we introduce an importance-guided task vector fusion technique that filters out redundant information and provides adaptive neuron-level weighting, further enhancing model editing performance. Extensive experiments on two publicly available datasets demonstrate the superiority of GeoEdit over existing state-of-the-art methods.
