Morphological Inflection Generation with Hard Monotonic Attention
Roee Aharoni, Yoav Goldberg
TL;DR
The paper presents a hard attention model for morphological inflection generation that enforces near-monotonic input-output alignments. It uses a biLSTM encoder and a decoder that either writes output symbols or advances the input pointer, trained on oracle action sequences derived from independent alignments. Across CELEX, Wiktionary, and SIGMORPHON datasets, the approach achieves state-of-the-art results, especially in low-resource settings, and demonstrates competitive decoding efficiency. Analyses compare hard and soft attention in terms of alignments and representations, offering insights into learned features for inflection tasks. The work suggests broader applicability to other monotonic align-and-transduce problems.
Abstract
We present a neural model for morphological inflection generation which employs a hard attention mechanism, inspired by the nearly-monotonic alignment commonly found between the characters in a word and the characters in its inflection. We evaluate the model on three previously studied morphological inflection generation datasets and show that it provides state of the art results in various setups compared to previous neural and non-neural approaches. Finally we present an analysis of the continuous representations learned by both the hard and soft attention \cite{bahdanauCB14} models for the task, shedding some light on the features such models extract.
