Geo-UNet: A Geometrically Constrained Neural Framework for Clinical-Grade Lumen Segmentation in Intravascular Ultrasound
Yiming Chen, Niharika S. D'Souza, Akshith Mandepally, Patrick Henninger, Satyananda Kashyap, Neerav Karani, Neel Dey, Marcos Zachary, Raed Rizq, Paul Chouinard, Polina Golland, Tanveer F. Syeda-Mahmood
TL;DR
This work tackles the challenge of precise lumen segmentation in venous IVUS (v-IVUS) for accurate stent sizing, addressing the limitations of standard UNet-based approaches under limited data and the radial geometry of IVUS. It introduces Geo-UNet, a geometry-informed, two-task network that operates in polar space to predict both a single lumen contour and a dense pixel-wise lumen mask, using a novel CDFeLU activation to fuse contour information into pixel predictions. The model optimizes a unified loss that combines area-, distance-, and contour-based penalties and adds an inference-time Geo-UNet++ step to reduce wrap-around artifacts at the 0/2π boundary. On a venous IVUS dataset, Geo-UNet and Geo-UNet++ outperform several baselines, achieving high Dice scores and clinically relevant lumen-diameter accuracy, with particularly notable improvements in contour contiguity and minor-diameter estimation. The approach suggests a practical pathway to clinical-grade lumen segmentation in radial imaging settings and offers avenues for extension to other radially structured modalities and 3D contexts.
Abstract
Precisely estimating lumen boundaries in intravascular ultrasound (IVUS) is needed for sizing interventional stents to treat deep vein thrombosis (DVT). Unfortunately, current segmentation networks like the UNet lack the precision needed for clinical adoption in IVUS workflows. This arises due to the difficulty of automatically learning accurate lumen contour from limited training data while accounting for the radial geometry of IVUS imaging. We propose the Geo-UNet framework to address these issues via a design informed by the geometry of the lumen contour segmentation task. We first convert the input data and segmentation targets from Cartesian to polar coordinates. Starting from a convUNet feature extractor, we propose a two-task setup, one for conventional pixel-wise labeling and the other for single boundary lumen-contour localization. We directly combine the two predictions by passing the predicted lumen contour through a new activation (named CDFeLU) to filter out spurious pixel-wise predictions. Our unified loss function carefully balances area-based, distance-based, and contour-based penalties to provide near clinical-grade generalization in unseen patient data. We also introduce a lightweight, inference-time technique to enhance segmentation smoothness. The efficacy of our framework on a venous IVUS dataset is shown against state-of-the-art models.
