Imbalanced Medical Image Segmentation with Pixel-dependent Noisy Labels
Erjian Guo, Zicheng Wang, Zhen Zhao, Luping Zhou
TL;DR
This work tackles pixel-wise noisy labels in medical image segmentation under class imbalance by introducing CLCS, a collaborative two-branch framework guided by a curriculum-based selection and a robust learning objective. The Curriculum Noisy Label Sample Selection (CNS) module uses a discrepancy-driven, two-branch network with a dynamic, class-aware threshold to identify clean labels, while the Noise Balance Loss (NBL) reuses noisy samples with a mixed CE and RCE loss weighted by confidence to improve data utilization. Key contributions include a discrepancy loss to prevent branch collapse, a convex-curvature curriculum threshold to address imbalance, and a balanced loss that mitigates overfitting to noisy labels, validated by extensive experiments on Endovis18 and RIGA showing consistent improvements over robust-loss and pixel-denoising baselines. The approach demonstrates practical impact by enhancing segmentation performance in real-world medical datasets with realistic noise patterns, offering a scalable solution for robust training when clean pixel-level annotations are scarce.
Abstract
Accurate medical image segmentation is often hindered by noisy labels in training data, due to the challenges of annotating medical images. Prior research works addressing noisy labels tend to make class-dependent assumptions, overlooking the pixel-dependent nature of most noisy labels. Furthermore, existing methods typically apply fixed thresholds to filter out noisy labels, risking the removal of minority classes and consequently degrading segmentation performance. To bridge these gaps, our proposed framework, Collaborative Learning with Curriculum Selection (CLCS), addresses pixel-dependent noisy labels with class imbalance. CLCS advances the existing works by i) treating noisy labels as pixel-dependent and addressing them through a collaborative learning framework, and ii) employing a curriculum dynamic thresholding approach adapting to model learning progress to select clean data samples to mitigate the class imbalance issue, and iii) applying a noise balance loss to noisy data samples to improve data utilization instead of discarding them outright. Specifically, our CLCS contains two modules: Curriculum Noisy Label Sample Selection (CNS) and Noise Balance Loss (NBL). In the CNS module, we designed a two-branch network with discrepancy loss for collaborative learning so that different feature representations of the same instance could be extracted from distinct views and used to vote the class probabilities of pixels. Besides, a curriculum dynamic threshold is adopted to select clean-label samples through probability voting. In the NBL module, instead of directly dropping the suspiciously noisy labels, we further adopt a robust loss to leverage such instances to boost the performance.
