DeepATLAS: One-Shot Localization for Biomedical Data
Peter D. Chang
TL;DR
DeepATLAS tackles dense anatomical localization in high-dimensional biomedical data by learning an anatomically-consistent, universal coordinate embedding. It uses two self-supervised registration objectives, implicit $\Phi_I$ and explicit $\Phi_E$, coupled with a feature-based similarity loss and smoothness regularization to produce stable coordinate maps that generalize across exams. After pretraining on $>51{,}000$ unlabeled CT volumes, one-shot and few-shot segmentation of 51 structures across four external cohorts achieve Dice scores around $0.70$–$0.84$ with $HD_{95}$ in the millimeter-to-centimeter range, often rivaling or exceeding supervised baselines; gains are enhanced by adding external data and semi-supervised fine-tuning. The learned representations support scalable preprocessing, cropping, and downstream tasks, with potential for out-of-distribution detection and active learning, making the approach broadly applicable beyond predefined atlas-based segmentation.
Abstract
This paper introduces the DeepATLAS foundational model for localization tasks in the domain of high-dimensional biomedical data. Upon convergence of the proposed self-supervised objective, a pretrained model maps an input to an anatomically-consistent embedding from which any point or set of points (e.g., boxes or segmentations) may be identified in a one-shot or few-shot approach. As a representative benchmark, a DeepATLAS model pretrained on a comprehensive cohort of 51,000+ unlabeled 3D computed tomography exams yields high one-shot segmentation performance on over 50 anatomic structures across four different external test sets, either matching or exceeding the performance of a standard supervised learning model. Further improvements in accuracy can be achieved by adding a small amount of labeled data using either a semisupervised or more conventional fine-tuning strategy.
