Solving morphological analogies: from retrieval to generation
Esteban Marquer, Miguel Couceiro
TL;DR
This work addresses morphological analogies by framing them as analogical proportions and separating detection from solving. It introduces a dual embedding strategy (CNN-based and AE) and two neural modules, ANNc for analogy classification and ANNr for retrieval/generation, augmented with axiom-guided data augmentation. Through extensive multilingual evaluation on Siganalogies and Sigmorphon/JBATS data, ANNr paired with AE consistently outperforms symbolic baselines and rivals or surpasses generation-based methods, while ANNc aids reliable retrieval. The study demonstrates strong cross-language performance, robustness to initialization, and practical guidelines for applying DL to APs, highlighting the value of morphologically focused embeddings and axiom-informed training. Overall, the framework advances automatic handling of morphological analogies with high accuracy across diverse languages and offers actionable insights for future research and applications in NLP and linguistics.
Abstract
Analogical inference is a remarkable capability of human reasoning, and has been used to solve hard reasoning tasks. Analogy based reasoning (AR) has gained increasing interest from the artificial intelligence community and has shown its potential in multiple machine learning tasks such as classification, decision making and recommendation with competitive results. We propose a deep learning (DL) framework to address and tackle two key tasks in AR: analogy detection and solving. The framework is thoroughly tested on the Siganalogies dataset of morphological analogical proportions (APs) between words, and shown to outperform symbolic approaches in many languages. Previous work have explored the behavior of the Analogy Neural Network for classification (ANNc) on analogy detection and of the Analogy Neural Network for retrieval (ANNr) on analogy solving by retrieval, as well as the potential of an autoencoder (AE) for analogy solving by generating the solution word. In this article we summarize these findings and we extend them by combining ANNr and the AE embedding model, and checking the performance of ANNc as an retrieval method. The combination of ANNr and AE outperforms the other approaches in almost all cases, and ANNc as a retrieval method achieves competitive or better performance than 3CosMul. We conclude with general guidelines on using our framework to tackle APs with DL.
