ProtoDiv: Prototype-guided Division of Consistent Pseudo-bags for Whole-slide Image Classification
Rui Yang, Pei Liu, Luping Ji
TL;DR
ProtoDiv tackles the data scarcity and weak labeling challenge in whole-slide image (WSI) classification by augmenting training data with pseudo-bags that are guided by bag prototypes. It introduces mean-based and attention-based prototypes to produce a cosine-distance–driven partitioning of patch features, yielding phenotype-diverse yet representative pseudo-bags that preserve the parent bag distribution. The approach is designed as a plug-and-play augmentation for existing MIL methods and demonstrates consistent AUC gains across seven baselines on TCGA-Lung and TCGA-BRCA, with efficient division times and intuitive visualizations of prototype separation. Overall, ProtoDiv enhances generalization in WSI MIL by maintaining phenotype consistency while safely increasing data size through adaptive pseudo-bag generation.
Abstract
Due to the limitations of inadequate Whole-Slide Image (WSI) samples with weak labels, pseudo-bag-based multiple instance learning (MIL) appears as a vibrant prospect in WSI classification. However, the pseudo-bag dividing scheme, often crucial for classification performance, is still an open topic worth exploring. Therefore, this paper proposes a novel scheme, ProtoDiv, using a bag prototype to guide the division of WSI pseudo-bags. Rather than designing complex network architecture, this scheme takes a plugin-and-play approach to safely augment WSI data for effective training while preserving sample consistency. Furthermore, we specially devise an attention-based prototype that could be optimized dynamically in training to adapt to a classification task. We apply our ProtoDiv scheme on seven baseline models, and then carry out a group of comparison experiments on two public WSI datasets. Experiments confirm our ProtoDiv could usually bring obvious performance improvements to WSI classification.
