Identifying Information-Transfer Nodes in a Recurrent Neural Network Reveals Dynamic Representations
Arend Hintze, Asadullah Najam, Jory Schossau
TL;DR
Addresses the challenge of interpreting RNN internal dynamics by identifying information-transfer nodes, termed information relays, using a mutual-information measure across hidden-layer subsets. The method computes $I_R = H(X_{ m in}; X_{ m out}; Y_R \,|\; Y_0)$ to quantify relayed information and uses a greedy algorithm to order nodes by contribution; for recurrent networks, relay information is evaluated at each time point after all inputs are presented to capture possible migration of information over time. The approach is validated on memory and block time-series classification tasks using RNN, GRU, and LSTM architectures, revealing that RNNs exhibit information migration between nodes while GRUs/LSTMs localize information in stable node sets, with concept-specific knockout effects; PCA analyses further show attractor-like latent structure. Overall, the work advances explainability and provides a scalable tool for diagnosing temporal information processing in sequential models.
Abstract
Understanding the internal dynamics of Recurrent Neural Networks (RNNs) is crucial for advancing their interpretability and improving their design. This study introduces an innovative information-theoretic method to identify and analyze information-transfer nodes within RNNs, which we refer to as \textit{information relays}. By quantifying the mutual information between input and output vectors across nodes, our approach pinpoints critical pathways through which information flows during network operations. We apply this methodology to both synthetic and real-world time series classification tasks, employing various RNN architectures, including Long Short-Term Memory (LSTM) networks and Gated Recurrent Units (GRUs). Our results reveal distinct patterns of information relay across different architectures, offering insights into how information is processed and maintained over time. Additionally, we conduct node knockout experiments to assess the functional importance of identified nodes, significantly contributing to explainable artificial intelligence by elucidating how specific nodes influence overall network behavior. This study not only enhances our understanding of the complex mechanisms driving RNNs but also provides a valuable tool for designing more robust and interpretable neural networks.
