Neural Surveillance: Live-Update Visualization of Latent Training Dynamics
Xianglin Yang, Jin Song Dong
TL;DR
The paper tackles the problem of real-time auditing of neural network training by visualizing evolving latent representations. It introduces SentryCam, a live-update visualization framework built on a dynamic spatio-temporal graph, density-guided sampling, and a GN-BN hybrid projection to produce faithful 2D representations with minimal latency. The authors provide theory linking sampling density to topology preservation via a relative-density metric and demonstrate a topology tipping point to guide practical sampling decisions, along with extensive experiments showing efficiency, high-quality visuals, and timely alerts for training failures. The approach yields actionable, early warnings and supports continual learning analyses, offering a practical tool for proactive model development and auditing with open-source availability.
Abstract
Monitoring the inner state of deep neural networks is essential for auditing the learning process and enabling timely interventions. While conventional metrics like validation loss offer a surface-level view of performance, the evolution of a model's hidden representations provides a deeper, complementary window into its internal dynamics. However, the literature lacks a real-time tool to monitor these crucial internal states. To address this, we introduce SentryCam, a live-update visualization framework that tracks the progression of hidden representations throughout training. SentryCam produces high-fidelity visualizations of the evolving representation space with minimal latency, serving as a powerful dashboard for understanding how a model learns. We quantitatively validate the faithfulness of SentryCam's visualizations across diverse datasets and architectures (ResNet, ViT). Furthermore, we demonstrate SentryCam's practical utility for model auditing through a case study on training instability. We designed an automated auditing system with geometry-based alerts that successfully identified impending model failure up to 7 epochs earlier than was evident from the validation loss curve. SentryCam's flexible framework is easily adaptable, supporting both the exploratory analysis and proactive auditing essential for robust model development. The code is available at https://github.com/xianglinyang/SentryCam.
