Relation Also Knows: Rethinking the Recall and Editing of Factual Associations in Auto-Regressive Transformer Language Models
Xiyu Liu, Zhengxiao Liu, Naibin Gu, Zheng Lin, Wanli Ma, Ji Xiang, Weiping Wang
TL;DR
The paper identifies a critical over-generalization problem in knowledge editing for auto-regressive transformers when edits target subject knowledge alone. It reveals that relational information is progressively encoded and recovered, with the last relation token serving as the pivotal point for recall and the MLP sublayers driving the accumulation of relation attributes. Building on this, the authors propose RETS, a relation-focused editing method that modifies the middle-late MLP at the last relation token while enforcing subject constraints to prevent unintended changes to neighboring facts. Empirical evidence on COUNTERFACT and zsRE shows RETS significantly improves Relation Specificity by over 30% and maintains competitive performance on key editing metrics, supporting the proposed relation-focused interpretation and highlighting new directions for knowledge editing in transformers.
Abstract
The storage and recall of factual associations in auto-regressive transformer language models (LMs) have drawn a great deal of attention, inspiring knowledge editing by directly modifying the located model weights. Most editing works achieve knowledge editing under the guidance of existing interpretations of knowledge recall that mainly focus on subject knowledge. However, these interpretations are seriously flawed, neglecting relation information and leading to the over-generalizing problem for editing. In this work, we discover a novel relation-focused perspective to interpret the knowledge recall of transformer LMs during inference and apply it on single knowledge editing to avoid over-generalizing. Experimental results on the dataset supplemented with a new R-Specificity criterion demonstrate that our editing approach significantly alleviates over-generalizing while remaining competitive on other criteria, breaking the domination of subject-focused editing for future research.
