Adaptive control of recurrent neural networks using conceptors
Guillaume Pourcel, Mirko Goldmann, Ingo Fischer, Miguel C. Soriano
TL;DR
Problem: Recurrent networks are typically adaptive during training but fixed during inference, limiting their ability to cope with changing conditions. Approach: the authors introduce an Adaptive Conceptor Control Loop (CCL) that online-estimates the current conceptor via autoconceptors and applies an adaptive projection $C_{adapt}$ to steer $x(k)$ toward a target subspace, i.e., $x(k+1) = C_{adapt}(k)[(1-\alpha)x(k) + \alpha\tanh(Wx(k) + W_{in}u(k) + b)]$. Key findings: the CCL yields more stable interpolation between temporal patterns than static conceptors, enhances robustness to partial degradation by preserving limit cycles, and supports distortion mitigation through a hierarchical Random Feature Conceptor (RFC) architecture. Significance: this framework extends RNN functionality beyond training, enabling on-the-fly adaptation in dynamic environments and offering potential for neuromorphic hardware implementations and few-shot learning scenarios.
Abstract
Recurrent Neural Networks excel at predicting and generating complex high-dimensional temporal patterns. Due to their inherent nonlinear dynamics and memory, they can learn unbounded temporal dependencies from data. In a Machine Learning setting, the network's parameters are adapted during a training phase to match the requirements of a given task/problem increasing its computational capabilities. After the training, the network parameters are kept fixed to exploit the learned computations. The static parameters thereby render the network unadaptive to changing conditions, such as external or internal perturbation. In this manuscript, we demonstrate how keeping parts of the network adaptive even after the training enhances its functionality and robustness. Here, we utilize the conceptor framework and conceptualize an adaptive control loop analyzing the network's behavior continuously and adjusting its time-varying internal representation to follow a desired target. We demonstrate how the added adaptivity of the network supports the computational functionality in three distinct tasks: interpolation of temporal patterns, stabilization against partial network degradation, and robustness against input distortion. Our results highlight the potential of adaptive networks in machine learning beyond training, enabling them to not only learn complex patterns but also dynamically adjust to changing environments, ultimately broadening their applicability.
