Crop and Couple: cardiac image segmentation using interlinked specialist networks
Abbas Khan, Muhammad Asad, Martin Benning, Caroline Roney, Gregory Slabaugh
TL;DR
This paper tackles automated segmentation of cardiac MRI into left ventricle, right ventricle, and myocardium, a key step in cardiovascular disease assessment. It introduces CroCNet, a two-stage pipeline where an initial E-2AUNet segments and crops the heart region, followed by weight-sharing specialist networks (LV-Net, RV-Net, MYO-Net) refined through efficient additive cross-attention to produce accurate final masks. The approach leverages an efficient additive attention mechanism to capture global context with lower complexity, and cross-attention acts as a soft relative shape prior to improve inter-anatomy consistency. On the MICCAI 2021 M&Ms-2 dataset, CroCNet achieves state-of-the-art Dice and Hausdorff distance, outperforming prior methods by notable margins, and ablations confirm the value of cropping and cross-attention for refining segmentation while maintaining single-view input efficiency.
Abstract
Diagnosis of cardiovascular disease using automated methods often relies on the critical task of cardiac image segmentation. We propose a novel strategy that performs segmentation using specialist networks that focus on a single anatomy (left ventricle, right ventricle, or myocardium). Given an input long-axis cardiac MR image, our method performs a ternary segmentation in the first stage to identify these anatomical regions, followed by cropping the original image to focus subsequent processing on the anatomical regions. The specialist networks are coupled through an attention mechanism that performs cross-attention to interlink features from different anatomies, serving as a soft relative shape prior. Central to our approach is an additive attention block (E-2A block), which is used throughout our architecture thanks to its efficiency.
