Learning inflection classes using Adaptive Resonance Theory
Peter Dekker, Heikki Rasilo, Bart de Boer
TL;DR
The paper investigates how inflection classes can be learned as cognitive abstractions by unsupervised clustering, using Adaptive Resonance Theory (ART1) with a tunable vigilance parameter to control generalisation. It applies a trigram-based binary encoding of selected paradigm cells to Latin, Portuguese, and Estonian data, evaluating clustering against linguistically annotated classes with ARI and AMI, and comparing to a k-means baseline. Results show near-perfect clustering for Latin in a narrow vigilance window, moderate success for Portuguese, and intermediate performance for Estonian, with all languages demonstrating some generalisation to unseen data. The authors discuss implications for modeling language change via diachronic agent-based simulations and outline future work to improve data representations and automatic vigilance tuning.
Abstract
The concept of inflection classes is an abstraction used by linguists, and provides a means to describe patterns in languages that give an analogical base for deducing previously unencountered forms. This ability is an important part of morphological acquisition and processing. We study the learnability of a system of verbal inflection classes by the individual language user by performing unsupervised clustering of lexemes into inflection classes. As a cognitively plausible and interpretable computational model, we use Adaptive Resonance Theory, a neural network with a parameter that determines the degree of generalisation (vigilance). The model is applied to Latin, Portuguese and Estonian. The similarity of clustering to attested inflection classes varies depending on the complexity of the inflectional system. We find the best performance in a narrow region of the generalisation parameter. The learned features extracted from the model show similarity with linguistic descriptions of the inflection classes. The proposed model could be used to study change in inflection classes in the future, by including it in an agent-based model.
