Memory-based Ensemble Learning in CMR Semantic Segmentation
Yiwei Liu, Ziyi Wu, Liang Zhong, Lingyi Wen, Yuankai Wu
TL;DR
The study tackles ventricular segmentation in cardiac cine MRI, where end-slice accuracy is clinically critical and traditional methods struggle. It introduces Streaming, a memory-based ensemble that leverages global uncertainty extracted from 3D frame variance to dynamically weight base classifiers at a pixel level, balanced by a joint Dice–Focal loss. A key contribution is the End Coefficient (EC), a metric quantifying end-slice performance, and comprehensive evaluation on ACDC and M&Ms showing near-state-of-the-art overall Dice and clear gains on end slices, with efficient end-to-end training. The approach offers clinically meaningful improvements in patient-specific segmentation by stabilizing end-slice predictions without sacrificing global accuracy, making it suitable for routine CMR analysis.
Abstract
Existing models typically segment either the entire 3D frame or 2D slices independently to derive clinical functional metrics from ventricular segmentation in cardiac cine sequences. While performing well overall, they struggle at the end slices. To address this, we leverage spatial continuity to extract global uncertainty from segmentation variance and use it as memory in our ensemble learning method, Streaming, for classifier weighting, balancing overall and end-slice performance. Additionally, we introduce the End Coefficient (EC) to quantify end-slice accuracy. Experiments on ACDC and M&Ms datasets show that our framework achieves near-state-of-the-art Dice Similarity Coefficient (DSC) and outperforms all models on end-slice performance, improving patient-specific segmentation accuracy.
