Latent space analysis and generalization to out-of-distribution data
Katie Rainey, Erin Hausmann, Donald Waagen, David Gray, Donald Hulsey
TL;DR
The paper investigates whether latent-space out-of-distribution (OOD) signals align with classifier accuracy when deploying SAR imagery classifiers. It analyzes paired synthetic-real SAR datasets (SAMPLE and MSTAR) using a ResNet-20 classifier with a latent space of dimension $256$ and a deep $k$-nearest neighbor OOD detector to assess in-task versus out-of-task data. The key finding is that latent-space OOD status is not a reliable proxy for classification performance across datasets and model instances, highlighting gaps between OOD signals and real-world robustness. The work emphasizes the need to understand latent-space geometry to improve robustness and generalization under domain shifts for SAR analysis.
Abstract
Understanding the relationships between data points in the latent decision space derived by the deep learning system is critical to evaluating and interpreting the performance of the system on real world data. Detecting \textit{out-of-distribution} (OOD) data for deep learning systems continues to be an active research topic. We investigate the connection between latent space OOD detection and classification accuracy of the model. Using open source simulated and measured Synthetic Aperture RADAR (SAR) datasets, we empirically demonstrate that the OOD detection cannot be used as a proxy measure for model performance. We hope to inspire additional research into the geometric properties of the latent space that may yield future insights into deep learning robustness and generalizability.
