Towards Open-Ended Visual Scientific Discovery with Sparse Autoencoders

Samuel Stevens; Jacob Beattie; Tanya Berger-Wolf; Yu Su

Towards Open-Ended Visual Scientific Discovery with Sparse Autoencoders

Samuel Stevens, Jacob Beattie, Tanya Berger-Wolf, Yu Su

TL;DR

This work tackles the bottleneck of scientific discovery with foundation models by proposing sparse autoencoders (SAEs) to extract open-ended, interpretable feature vocabularies from unlabeled foundation-model activations. By applying SAEs to DINOv3 ViT representations and evaluating on ADE20K and FishVista, the authors demonstrate that SAEs can rediscover semantic concepts and surface fine-grained anatomical structures without supervision, with Matryoshka SAEs offering improved concept coverage. The approach is shown to be domain-agnostic, suggesting applicability to proteins, genomics, weather, and more, and highlights a shift from confirmation-focused analyses to hypothesis-generating discovery. The findings provide a practical pathway to quantify and interrogate what large scientific foundation models have learned, enabling reproducible, open-ended exploration and potential new scientific hypotheses.

Abstract

Scientific archives now contain hundreds of petabytes of data across genomics, ecology, climate, and molecular biology that could reveal undiscovered patterns if systematically analyzed at scale. Large-scale, weakly-supervised datasets in language and vision have driven the development of foundation models whose internal representations encode structure (patterns, co-occurrences and statistical regularities) beyond their training objectives. Most existing methods extract structure only for pre-specified targets; they excel at confirmation but do not support open-ended discovery of unknown patterns. We ask whether sparse autoencoders (SAEs) can enable open-ended feature discovery from foundation model representations. We evaluate this question in controlled rediscovery studies, where the learned SAE features are tested for alignment with semantic concepts on a standard segmentation benchmark and compared against strong label-free alternatives on concept-alignment metrics. Applied to ecological imagery, the same procedure surfaces fine-grained anatomical structure without access to segmentation or part labels, providing a scientific case study with ground-truth validation. While our experiments focus on vision with an ecology case study, the method is domain-agnostic and applicable to models in other sciences (e.g., proteins, genomics, weather). Our results indicate that sparse decomposition provides a practical instrument for exploring what scientific foundation models have learned, an important prerequisite for moving from confirmation to genuine discovery.

Towards Open-Ended Visual Scientific Discovery with Sparse Autoencoders

TL;DR

Abstract

Towards Open-Ended Visual Scientific Discovery with Sparse Autoencoders

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)