A Unified Framework for Model Editing

Akshat Gupta; Dev Sajnani; Gopala Anumanchipalli

A Unified Framework for Model Editing

Akshat Gupta, Dev Sajnani, Gopala Anumanchipalli

TL;DR

This paper generalizes ROME and enable batched editing with equality constraint in the form of EMMET - an Equality-constrained Mass Model Editing algorithm for Transformers, a new batched memory-editing algorithm.

Abstract

ROME and MEMIT are largely believed to be two different model editing algorithms, with the major difference between them being the ability to perform batched edits. In this paper, we unify these two algorithms under a single conceptual umbrella, optimizing for the same goal, which we call the preservation-memorization objective. ROME uses an equality constraint to optimize this objective to perform one edit at a time, whereas MEMIT employs a more flexible least-square constraint that allows for batched edits. We generalize ROME and enable batched editing with equality constraint in the form of EMMET - an Equality-constrained Mass Model Editing algorithm for Transformers, a new batched memory-editing algorithm. EMMET can perform batched-edits up to a batch-size of 10,000, with very similar performance to MEMIT across multiple dimensions. With the introduction of EMMET, we truly unify ROME and MEMIT and show that both algorithms are equivalent in terms of their optimization objective, their abilities (singular and batched editing), their model editing performance and their limitations.

A Unified Framework for Model Editing

TL;DR

Abstract

Paper Structure (18 sections, 18 equations, 22 figures, 3 tables)

This paper contains 18 sections, 18 equations, 22 figures, 3 tables.

Introduction
Background
Preservation-Memorization : A Unifying Framework for ROME and MEMIT
Edit-Distribution Algorithms
Impact of edit-distribution Algorithms
Introducing EMMET
Stabilizing EMMET
Batch Editing with EMMET
Related Work
Conclusion
Limitations
Ethical Considerations
Appendix
Implementation Details for ROME, MEMIT and EMMET
Key-Value creation in ROME/MEMIT
...and 3 more sections

Figures (22)

Figure 1: A diagrammatic representation of the preservation-memorization objective.
Figure 2: Figure shows a diagrammatic representation of a transformer layer. The layer being edited by ROME, MEMIT and EMMET is the projection weight matrix inside the MLP layer ($W_{proj}$).
Figure 3: Performance comparison of model editing using MEMIT when editing just one layer against multiple layers using the MEMIT edit-distribution algorithm on the CounterFact dataset.
Figure 4: Single layer editing performance of EMMET as a function of batch size when compared to MEMIT on the CounterFact dataset.
Figure 5: Performance comparison of EMMET and MEMIT when distributing the edit over multiple layers using the MEMIT edit-distribution algorithm on the CounterFact dataset.
...and 17 more figures

A Unified Framework for Model Editing

TL;DR

Abstract

A Unified Framework for Model Editing

Authors

TL;DR

Abstract

Table of Contents

Figures (22)