Generalized Measures of Anticipation and Responsivity in Online Language Processing

Mario Giulianelli; Andreas Opedal; Ryan Cotterell

Generalized Measures of Anticipation and Responsivity in Online Language Processing

Mario Giulianelli, Andreas Opedal, Ryan Cotterell

TL;DR

This framework provides a formal definition of anticipatory and responsive measures, and it equips experimenters with the tools to define new, more expressive measures beyond standard next-symbol entropy and surprisal.

Abstract

We introduce a generalization of classic information-theoretic measures of predictive uncertainty in online language processing, based on the simulation of expected continuations of incremental linguistic contexts. Our framework provides a formal definition of anticipatory and responsive measures, and it equips experimenters with the tools to define new, more expressive measures beyond standard next-symbol entropy and surprisal. While extracting these standard quantities from language models is convenient, we demonstrate that using Monte Carlo simulation to estimate alternative responsive and anticipatory measures pays off empirically: New special cases of our generalized formula exhibit enhanced predictive power compared to surprisal for human cloze completion probability as well as ELAN, LAN, and N400 amplitudes, and greater complementarity with surprisal in predicting reading times.

Generalized Measures of Anticipation and Responsivity in Online Language Processing

TL;DR

Abstract

Paper Structure (61 sections, 7 equations, 18 figures)

This paper contains 61 sections, 7 equations, 18 figures.

Introduction
Generalized Surprisal
Language Modeling
The Human Language Model.
Generalizing Surprisal
The Scoring Function.
The Warping Function.
Anticipation and Responsivity
Special Cases of Generalized Surprisal
Responsive Measures
Surprisal.
Probability.
Information Value.
Anticipatory Measures
Expected Next-symbol Surprisal.
...and 46 more sections

Figures (18)

Figure 1: Coefficient of variation (top), correlation between resamples (center), and runtimes (bottom) for sampling-based measures across the stimuli in the Aligned dataset. Confidence intervals ($95\%$) are too narrow to be visible; the horizontal axis is in log scale. The average runtime for the exact metrics (surprisal, probability, expected next-symbol surprisal, and expected next-symbol probability) is $0.002$ seconds.
Figure 2: Predictive power (${ \Delta_{{ R^2}}}$) of responsive generalized surprisal models for event-related potentials and reading times. 95% confidence intervals. Significance color-coded: blue for $p<0.0001$, gray for $p>0.01$.
Figure 3: Probability and surprisal against human cloze probabilities and predictability ratings, with Pearson correlation coefficients $r$ and regression lines. For regressions, $x^{({ m})} = { \gamma}({ \boldsymbol{w}}^{({ m})}, {\boldsymbol{c}}^{({ m})})$ is the predictor and $y^{({ m})} = { \psi}({ \boldsymbol{w}}^{({ m})}, {\boldsymbol{c}}^{({ m})})$ the predicted variable.
Figure 4: Difference between the predictive power of anticipatory measures used in combination with surprisal vs. expected next-symbol surprisal used in combination with surprisal; 95% confidence intervals. The red dotted line represents the combined baseline regressor. Significance is color-coded, as described in the legend.
Figure 5: Pearson correlation between responsive and anticipatory measures. Estimates obtained for the Aligned dataset. Monte Carlo (MC) samples with ${ N}=2^{9}$ and ${ L} = 5$ from GPT-2 Small.
...and 13 more figures

Theorems & Definitions (2)

Definition 1: Generalized Surprisal
Definition 2: Anticipation and Responsivity

Generalized Measures of Anticipation and Responsivity in Online Language Processing

TL;DR

Abstract

Generalized Measures of Anticipation and Responsivity in Online Language Processing

Authors

TL;DR

Abstract

Table of Contents

Figures (18)

Theorems & Definitions (2)