Re-envisioning Euclid Galaxy Morphology: Identifying and Interpreting Features with Sparse Autoencoders
John F. Wu, Michael Walmsley
TL;DR
The paper addresses interpreting deep morphology representations and discovering rare galaxy features in large surveys by applying Matryoshka sparse autoencoders (SAEs) to embeddings from supervised Zoobot and self-supervised MAE models on Euclid Q1 data. It demonstrates that SAEs yield monosemantic, interpretable features that align with Galaxy Zoo classifications better than PCA, while also revealing features beyond the traditional taxonomy; MAE-based embeddings achieve near-superhuman reconstruction, underscoring the richness of information captured by self-supervised representations. The work shows SAEs as scalable discovery engines that can surface novel or artifact-related phenomena in upcoming surveys, though interpretability and artifact sensitivity remain challenges. Public release of the MAE model and SAE code enables community-driven exploration of morphology concepts across next-generation astronomical datasets.
Abstract
Sparse Autoencoders (SAEs) can efficiently identify candidate monosemantic features from pretrained neural networks for galaxy morphology. We demonstrate this on Euclid Q1 images using both supervised (Zoobot) and new self-supervised (MAE) models. Our publicly released MAE achieves superhuman image reconstruction performance. While a Principal Component Analysis (PCA) on the supervised model primarily identifies features already aligned with the Galaxy Zoo decision tree, SAEs can identify interpretable features outside of this framework. SAE features also show stronger alignment than PCA with Galaxy Zoo labels. Although challenges in interpretability remain, SAEs provide a powerful engine for discovering astrophysical phenomena beyond the confines of human-defined classification.
