Deep ARTMAP: Generalized Hierarchical Learning with Adaptive Resonance Theory
Niklas M. Melton, Leonardo Enzo Brito da Silva, Sasha Petrenko, Donald. C. Wunsch
TL;DR
Deep ARTMAP addresses the challenge of learning hierarchical structure from multi-modal data by extending ART-based architectures to a multi-layer, transform-aware framework. It generalizes SMART and ARTMAP through a chain of module blocks driven by arbitrary nonlinear transforms $f_k$, enabling both unsupervised divisive clustering and supervised learning within the same architecture; inter-ART map fields enforce structured cross-layer mappings. The key contributions include showing that Deep ARTMAP reduces to SMART under identity transforms and to ARTMAP for $L=2$, outlining an incremental online training algorithm that processes samples top-down and supports parallelizable inference, and providing open-source software implementations in Python and Julia. This framework offers scalable, continual-learning capable hierarchical clustering for multi-modal data, with potential applications in complex relational tasks and reduced catastrophic forgetting in dynamic environments.
Abstract
This paper presents Deep ARTMAP, a novel extension of the ARTMAP architecture that generalizes the self-consistent modular ART (SMART) architecture to enable hierarchical learning (supervised and unsupervised) across arbitrary transformations of data. The Deep ARTMAP framework operates as a divisive clustering mechanism, supporting an arbitrary number of modules with customizable granularity within each module. Inter-ART modules regulate the clustering at each layer, permitting unsupervised learning while enforcing a one-to-many mapping from clusters in one layer to the next. While Deep ARTMAP reduces to both ARTMAP and SMART in particular configurations, it offers significantly enhanced flexibility, accommodating a broader range of data transformations and learning modalities.
