Table of Contents
Fetching ...

Interlocking-free Selective Rationalization Through Genetic-based Learning

Federico Ruggeri, Gaetano Signorelli

TL;DR

This work presents GenSPP, the first interlocking-free architecture for selective rationalization that does not require any learning overhead, and shows that the model outperforms several state-of-the-art competitors.

Abstract

A popular end-to-end architecture for selective rationalization is the select-then-predict pipeline, comprising a generator to extract highlights fed to a predictor. Such a cooperative system suffers from suboptimal equilibrium minima due to the dominance of one of the two modules, a phenomenon known as interlocking. While several contributions aimed at addressing interlocking, they only mitigate its effect, often by introducing feature-based heuristics, sampling, and ad-hoc regularizations. We present GenSPP, the first interlocking-free architecture for selective rationalization that does not require any learning overhead, as the above-mentioned. GenSPP avoids interlocking by performing disjoint training of the generator and predictor via genetic global search. Experiments on a synthetic and a real-world benchmark show that our model outperforms several state-of-the-art competitors.

Interlocking-free Selective Rationalization Through Genetic-based Learning

TL;DR

This work presents GenSPP, the first interlocking-free architecture for selective rationalization that does not require any learning overhead, and shows that the model outperforms several state-of-the-art competitors.

Abstract

A popular end-to-end architecture for selective rationalization is the select-then-predict pipeline, comprising a generator to extract highlights fed to a predictor. Such a cooperative system suffers from suboptimal equilibrium minima due to the dominance of one of the two modules, a phenomenon known as interlocking. While several contributions aimed at addressing interlocking, they only mitigate its effect, often by introducing feature-based heuristics, sampling, and ad-hoc regularizations. We present GenSPP, the first interlocking-free architecture for selective rationalization that does not require any learning overhead, as the above-mentioned. GenSPP avoids interlocking by performing disjoint training of the generator and predictor via genetic global search. Experiments on a synthetic and a real-world benchmark show that our model outperforms several state-of-the-art competitors.

Paper Structure

This paper contains 48 sections, 8 equations, 2 figures, 9 tables, 1 algorithm.

Figures (2)

  • Figure 1: Loss landscape comparison between our fitness function $\tilde{h}$ (left) and the regularized selective rationalization objective (Eq. \ref{['eq:regularized-rationalization']}). Red markers denote points (0.0, 1.0) and (0.5, 0.5), highlighting the difference between the two losses.
  • Figure 2: Number of contiguous highlights (i.e., connected token groups) in HateXplain.