Entailment Semantics Can Be Extracted from an Ideal Language Model

William Merrill; Alex Warstadt; Tal Linzen

Entailment Semantics Can Be Extracted from an Ideal Language Model

William Merrill, Alex Warstadt, Tal Linzen

TL;DR

It is proved that entailment judgments between sentences can be extracted from an ideal language model that has perfectly learned its target distribution, assuming the training sentences are generated by Gricean agents, i.e., agents who follow fundamental principles of communication from the linguistic theory of pragmatics.

Abstract

Language models are often trained on text alone, without additional grounding. There is debate as to how much of natural language semantics can be inferred from such a procedure. We prove that entailment judgments between sentences can be extracted from an ideal language model that has perfectly learned its target distribution, assuming the training sentences are generated by Gricean agents, i.e., agents who follow fundamental principles of communication from the linguistic theory of pragmatics. We also show entailment judgments can be decoded from the predictions of a language model trained on such Gricean data. Our results reveal a pathway for understanding the semantic information encoded in unlabeled linguistic data and a potential framework for extracting semantics from language models.

Entailment Semantics Can Be Extracted from an Ideal Language Model

TL;DR

Abstract

Paper Structure (13 sections, 5 theorems, 46 equations)

This paper contains 13 sections, 5 theorems, 46 equations.

Uncountable World Spaces
RSA
Non-Dynamic RSA
Base Case
Inductive Case
Dynamic RSA
Base Case
Inductive Case
General Relations and Speakers
Forward Direction
Backward Direction
Discussion
Proofs for Learning Bounds

Key Result

lemma 1

Let $\mathbbm{1}_{\mathcal{S}}$ be the indicator function for set $\mathcal{S}$. Let $f : \mathcal{W} \to \mathbb{R}$ be some function such that $\inf_{w \in \mathcal{W}} f(w) > 0$. For any sets $\mathcal{A}, \mathcal{B}$ such that $\mathcal{A} \subseteq \mathcal{B} \subseteq \mathcal{W}$, then $p(\

Theorems & Definitions (18)

definition 1: Entailment
lemma 1
proof
lemma 2
proof
lemma 3
proof
theorem 1
proof
theorem 2
...and 8 more

Entailment Semantics Can Be Extracted from an Ideal Language Model

TL;DR

Abstract

Entailment Semantics Can Be Extracted from an Ideal Language Model

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (18)