The Structure of Relation Decoding Linear Operators in Large Language Models

Miranda Anna Christ; Adrián Csiszárik; Gergely Becsó; Dániel Varga

The Structure of Relation Decoding Linear Operators in Large Language Models

Miranda Anna Christ, Adrián Csiszárik, Gergely Becsó, Dániel Varga

TL;DR

The paper investigates how transformer models encode relational knowledge through Linear Relational Embeddings (LREs) and shows that a collection of such decoders can be compressed into order-3 tensor networks without substantial loss in decoding accuracy. Using a cross-evaluation protocol, the authors reveal that decoders rely on shared, coarse-grained properties rather than strictly distinct relation mappings, yielding a property-based rather than relation-specific organization. They demonstrate two tensor-network architectures (SimpleOrder3Network and TriangleTensorNetwork) that achieve high compression, analyze the semantic structure via cross-evaluation blocks, and show that generalization to held-out relations is limited to semantically close relations, with strong generalization in a controlled arithmetic dataset. The work connects to broader themes in knowledge representation, tensor decompositions, and model compression, highlighting both interpretability benefits and the boundaries of generalization in real-world language domains.

Abstract

This paper investigates the structure of linear operators introduced in Hernandez et al. [2023] that decode specific relational facts in transformer language models. We extend their single-relation findings to a collection of relations and systematically chart their organization. We show that such collections of relation decoders can be highly compressed by simple order-3 tensor networks without significant loss in decoding accuracy. To explain this surprising redundancy, we develop a cross-evaluation protocol, in which we apply each linear decoder operator to the subjects of every other relation. Our results reveal that these linear maps do not encode distinct relations, but extract recurring, coarse-grained semantic properties (e.g., country of capital city and country of food are both in the country-of-X property). This property-centric structure clarifies both the operators' compressibility and highlights why they generalize only to new relations that are semantically close. Our findings thus interpret linear relational decoding in transformer language models as primarily property-based, rather than relation-specific.

The Structure of Relation Decoding Linear Operators in Large Language Models

TL;DR

Abstract

The Structure of Relation Decoding Linear Operators in Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)

Theorems & Definitions (2)