Density Matrices for Metaphor Understanding
Jay Owers, Ekaterina Shutova, Martha Lewis
TL;DR
This work investigates modeling metaphor as lexical ambiguity using density matrices within a categorical compositional framework, testing whether mixtures of word senses can capture metaphorical meaning. It integrates the CPM construction with DisCo to produce sentence meanings as density operators, learning word-density matrices from text via Multi-sense Word2DM and applying composition operators such as Add, Mult, Fuzz, and Phaser. Through a newly created metaphor-disambiguation dataset, the study finds that metaphor is notably hard to model and neuro-based encoders underperform, while some density-matrix methods yield modest improvements over baselines, with the best results using a 10-sense Word2DM and Mult composition. The results offer insights into how context and operator choices influence disambiguation and point to future work in expanding model coverage, testing modern language models, and linking ambiguity with hyponymy.
Abstract
In physics, density matrices are used to represent mixed states, i.e. probabilistic mixtures of pure states. This concept has previously been used to model lexical ambiguity. In this paper, we consider metaphor as a type of lexical ambiguity, and examine whether metaphorical meaning can be effectively modelled using mixtures of word senses. We find that modelling metaphor is significantly more difficult than other kinds of lexical ambiguity, but that our best-performing density matrix method outperforms simple baselines as well as some neural language models.
