Scalable Model Editing via Customized Expert Networks
Zihan Yao, Yu He, Tianyu Qi, Ming Li
TL;DR
This work tackles the challenge of updating factual knowledge in large language models without widespread retraining, tackling hallucinations and outdated information. It introduces SCEN, a two-stage paradigm that builds a lightweight, per-sample expert network for each edit and uses scalable indexing neurons to activate the appropriate expert during inference, preserving original weights to maintain locality. Evaluated on ZsRE and Hallucination with Llama2-7B/13B, SCEN achieves state-of-the-art results across reliability, generality, and locality, and demonstrates robust hallucination mitigation with stable perplexity behavior. The approach offers a practical, interpretable mechanism for precise, scalable knowledge editing in transformer models, with potential for efficiency improvements through weight compression in future work.
Abstract
Addressing the issues of hallucinations and outdated knowledge in large language models is critical for their reliable application. Model Editing presents a promising avenue for mitigating these challenges in a cost-effective manner. However, existing methods often suffer from unsatisfactory generalization and unintended effects on non-edited samples. To overcome these limitations, we introduce a novel approach: Scalable Model Editing via Customized Expert Networks (SCEN), which is a two-stage continuous training paradigm. Specifically, in the first stage, we train lightweight expert networks individually for each piece of knowledge that needs to be updated. Subsequently, we train a corresponding indexing neuron for each expert to control the activation state of that expert. We conducted a series of experiments on the ZsRE and Hallucination benchmarks by tuning the advanced open-source LLM, Llama2, achieving state-of-the-art results compared to current mainstream methods. Our code is available at https://github.com/TAL-auroraX/SCEN.
