Distributional Inclusion Hypothesis and Quantifications: Probing for Hypernymy in Functional Distributional Semantics
Chun Hei Lo, Wai Lam, Hong Cheng, Guy Emerson
TL;DR
This work investigates whether Hypernymy can be learned within Functional Distributional Semantics (FDS) by connecting the Distributional Inclusion Hypothesis (DIH) with quantifications. It shows that conventional DIH-supported hypernymy learning emerges when training corpora obey the DIH, and introduces a universal-quantification objective (FDS∀) that enables hypernymy learning under the reverse DIH (rDIH). Across synthetic hierarchies and the large WordNet-based real hierarchy, FDS∀ improves hypernymy detection, and on real WikiWoods data, FDS∀ enhances performance over the baseline FDS. The results also demonstrate distributional generalization to nouns with incomplete context, suggesting practical benefits for open-class semantic learning and reasoning under quantification in FDS.
Abstract
Functional Distributional Semantics (FDS) models the meaning of words by truth-conditional functions. This provides a natural representation for hypernymy but no guarantee that it can be learnt when FDS models are trained on a corpus. In this paper, we probe into FDS models and study the representations learnt, drawing connections between quantifications, the Distributional Inclusion Hypothesis (DIH), and the variational-autoencoding objective of FDS model training. Using synthetic data sets, we reveal that FDS models learn hypernymy on a restricted class of corpus that strictly follows the DIH. We further introduce a training objective that both enables hypernymy learning under the reverse of the DIH and improves hypernymy detection from real corpora.
