QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis
Junlin Hou, Jilan Xu, Rui Feng, Hao Chen
TL;DR
This work tackles robustness in retinal disease diagnosis under mixed noise, where both label noise and data/quality noise degrade performance. It introduces QMix, a two-branch co-training framework that alternates between sample separation—via a joint uncertainty–loss Gaussian Mixture Model to categorize samples as Correct, Mis-H, or Mis-L—and quality-aware semi-supervised training that refines labels, learns robust representations, and uses a contrastive enhancement to separate Mis-L. The method combines a novel sample-reweighing loss and a contrastive loss within SSL, plus a regularization term, to suppress the impact of Mis-L while preserving useful information from Correct and Mis-H. Extensive experiments on six retinal datasets show state-of-the-art results under synthetic and real-world mixed noise, with strong robustness to high noise ratios and clear improvements in sample separation quality, indicating practical value for medical image analysis in noisy acquisition settings.
Abstract
Due to the complexity of medical image acquisition and the difficulty of annotation, medical image datasets inevitably contain noise. Noisy data with wrong labels affects the robustness and generalization ability of deep neural networks. Previous noise learning methods mainly considered noise arising from images being mislabeled, i.e. label noise, assuming that all mislabeled images are of high image quality. However, medical images are prone to suffering extreme quality issues, i.e. data noise, where discriminative visual features are missing for disease diagnosis. In this paper, we propose a noise learning framework, termed as QMix, that learns a robust disease diagnosis model under mixed noise. QMix alternates between sample separation and quality-aware semisupervised training in each training epoch. In the sample separation phase, we design a joint uncertainty-loss criterion to effectively separate (1) correctly labeled images; (2) mislabeled images with high quality and (3) mislabeled images with low quality. In the semi-supervised training phase, we train a disease diagnosis model to learn robust feature representation from the separated samples. Specifically, we devise a sample-reweighing loss to mitigate the effect of mislabeled images with low quality during training. Meanwhile, a contrastive enhancement loss is proposed to further distinguish mislabeled images with low quality from correctly labeled images. QMix achieved state-of-the-art disease diagnosis performance on five public retinal image datasets and exhibited substantial improvement on robustness against mixed noise.
