MergeUp-augmented Semi-Weakly Supervised Learning for WSI Classification
Mingxi Ouyang, Yuqiu Fu, Renao Yan, ShanShan Shi, Xitong Ling, Lianghui Zhu, Yonghong He, Tian Guan
TL;DR
This paper tackles WSI classification under weak supervision by converting to a semi-weakly supervised setting. It introduces AdaPse to adaptively assign pseudo bag labels and MergeUp to mix bags across categories, all within a student–teacher SSL framework that enforces consistency between weak (identity) and strong (MergeUp) augmentations. Empirical results on CAMELYON-16, BRACS, and TCGA-LUNG show substantial improvements over state-of-the-art MIL methods, with ablations confirming the benefits of adaptive pseudo labeling and inter-category augmentation. The approach offers a scalable, annotation-free path to improved MIL for pathology, with potential impact on diagnostic accuracy and data efficiency.
Abstract
Recent advancements in computational pathology and artificial intelligence have significantly improved whole slide image (WSI) classification. However, the gigapixel resolution of WSIs and the scarcity of manual annotations present substantial challenges. Multiple instance learning (MIL) is a promising weakly supervised learning approach for WSI classification. Recently research revealed employing pseudo bag augmentation can encourage models to learn various data, thus bolstering models' performance. While directly inheriting the parents' labels can introduce more noise by mislabeling in training. To address this issue, we translate the WSI classification task from weakly supervised learning to semi-weakly supervised learning, termed SWS-MIL, where adaptive pseudo bag augmentation (AdaPse) is employed to assign labeled and unlabeled data based on a threshold strategy. Using the "student-teacher" pattern, we introduce a feature augmentation technique, MergeUp, which merges bags with low-priority bags to enhance inter-category information, increasing training data diversity. Experimental results on the CAMELYON-16, BRACS, and TCGA-LUNG datasets demonstrate the superiority of our method over existing state-of-the-art approaches, affirming its efficacy in WSI classification.
