vesselFM: A Foundation Model for Universal 3D Blood Vessel Segmentation
Bastian Wittmann, Yannick Wattenberg, Tamaz Amiranashvili, Suprosanna Shit, Bjoern Menze
TL;DR
VesselFM tackles universal 3D blood vessel segmentation by training a foundation model on three heterogeneous data sources: $\mathcal{D}_\text{real}$, $\mathcal{D}_\text{drand}$, and $\mathcal{D}_\text{flow}$, to bridge domain gaps across modalities. It achieves zero-shot generalization to unseen domains and strong one- and few-shot performance across four clinically relevant datasets, leveraging a UNet-based architecture and flow-matching data generation. The approach combines domain randomization with a mask- and class-conditioned flow matching generator to produce large-scale, anatomically coherent image-mask pairs, yielding state-of-the-art Dice and clDice results and robust tubular vessel predictions. Ablation analyses demonstrate the necessity of all three data sources and the flow-matching generator, highlighting vesselFM's practical utility for accelerating vascular imaging research and reducing annotation burden with open-source checkpoints and code.
Abstract
Segmenting 3D blood vessels is a critical yet challenging task in medical image analysis. This is due to significant imaging modality-specific variations in artifacts, vascular patterns and scales, signal-to-noise ratios, and background tissues. These variations, along with domain gaps arising from varying imaging protocols, limit the generalization of existing supervised learning-based methods, requiring tedious voxel-level annotations for each dataset separately. While foundation models promise to alleviate this limitation, they typically fail to generalize to the task of blood vessel segmentation, posing a unique, complex problem. In this work, we present vesselFM, a foundation model designed specifically for the broad task of 3D blood vessel segmentation. Unlike previous models, vesselFM can effortlessly generalize to unseen domains. To achieve zero-shot generalization, we train vesselFM on three heterogeneous data sources: a large, curated annotated dataset, data generated by a domain randomization scheme, and data sampled from a flow matching-based generative model. Extensive evaluations show that vesselFM outperforms state-of-the-art medical image segmentation foundation models across four (pre-)clinically relevant imaging modalities in zero-, one-, and few-shot scenarios, therefore providing a universal solution for 3D blood vessel segmentation.
