Accurate Segmentation of Optic Disc And Cup from Multiple Pseudo-labels by Noise-aware Learning
Tengjin Weng, Yang Shen, Zhidong Zhao, Zhiming Cheng, Shuai Wang
TL;DR
This paper tackles the problem of label noise arising from boundary ambiguity in optic disc and cup segmentation. It introduces the Multiple Pseudo-labels Noise-aware Network (MPNN), combining the MPGGD module (which generates $K$ pseudo-labels via diverse initializations and thresholds the clean versus noisy pixels by consensus) with a Mean-Teacher style learning framework that uses an EMA-updated teacher to enforce uncertainty-guided consistency. Clean pixels drive supervised learning while noisy pixels contribute through a contrastive-like consistency loss, controlled by per-pixel uncertainty estimates and a ramped loss weight $\lambda(t)$. On the RIGA dataset, MPNN achieves state-of-the-art performance for both ground-truth references (Majority Vote and Rater1), demonstrating robust denoising of noisy annotations and improved segmentation accuracy, with code released for reproducibility.
Abstract
Optic disc and cup segmentation plays a crucial role in automating the screening and diagnosis of optic glaucoma. While data-driven convolutional neural networks (CNNs) show promise in this area, the inherent ambiguity of segmenting objects and background boundaries in the task of optic disc and cup segmentation leads to noisy annotations that impact model performance. To address this, we propose an innovative label-denoising method of Multiple Pseudo-labels Noise-aware Network (MPNN) for accurate optic disc and cup segmentation. Specifically, the Multiple Pseudo-labels Generation and Guided Denoising (MPGGD) module generates pseudo-labels by multiple different initialization networks trained on true labels, and the pixel-level consensus information extracted from these pseudo-labels guides to differentiate clean pixels from noisy pixels. The training framework of the MPNN is constructed by a teacher-student architecture to learn segmentation from clean pixels and noisy pixels. Particularly, such a framework adeptly leverages (i) reliable and fundamental insight from clean pixels and (ii) the supplementary knowledge within noisy pixels via multiple perturbation-based unsupervised consistency. Compared to other label-denoising methods, comprehensive experimental results on the RIGA dataset demonstrate our method's excellent performance. The code is available at https://github.com/wwwtttjjj/MPNN
