Discovering interpretable models of scientific image data with deep learning
Christopher J. Soelistyo, Alan R. Lowe
TL;DR
This work tackles the problem of extracting interpretable, domain-appropriate scientific models from raw image data by marrying disentangled representation learning, sparse network training, and symbolic regression. The authors test these methods on a bioimaging problem—classifying chromatin morphology in live-cell microscopy—demonstrating that highly parsimonious models can approach the accuracy of black-box benchmarks while offering clear interpretability and domain insight. Key findings include a semantic latent space that aligns with biological factors, sparse models that illuminate which latent features drive decisions, and symbolic expressions that reveal explicit decision boundaries, all validated against adversarial perturbations to assess domain-appropriateness. The results suggest that an approximate Rashomon set of models exists in this domain, enabling a practical, interpretable discovery system with potential broad applicability in scientific contexts.
Abstract
How can we find interpretable, domain-appropriate models of natural phenomena given some complex, raw data such as images? Can we use such models to derive scientific insight from the data? In this paper, we propose some methods for achieving this. In particular, we implement disentangled representation learning, sparse deep neural network training and symbolic regression, and assess their usefulness in forming interpretable models of complex image data. We demonstrate their relevance to the field of bioimaging using a well-studied test problem of classifying cell states in microscopy data. We find that such methods can produce highly parsimonious models that achieve $\sim98\%$ of the accuracy of black-box benchmark models, with a tiny fraction of the complexity. We explore the utility of such interpretable models in producing scientific explanations of the underlying biological phenomenon.
