Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis

Kunming Tang; Zhiguo Jiang; Jun Shi; Wei Wang; Haibo Wu; Yushan Zheng

Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis

Kunming Tang, Zhiguo Jiang, Jun Shi, Wei Wang, Haibo Wu, Yushan Zheng

TL;DR

The paper tackles the challenge of data augmentation for gigapixel WSI classification under MIL, where fixed patch representations hinder augmentation and efficiency. It introduces Promptable Representation Distribution Learning (PRDL), which learns a patch-level representation distribution and uses augmentation prompts to guide feature-space augmentation, integrated with a DINO-style SSL backbone. A promptable representation sampling (PRS) module enables online sampling from these distributions during WSI training, providing controllable, efficient augmentation without extra model parameters for inference. Empirical results across three lung-related datasets show that PRDL with PRS consistently outperforms state-of-the-art representation learning baselines and other WSI augmentation methods, demonstrating improved robustness and MIL-based WSI classification performance. The work offers a scalable pathway to diversify patch representations with principled control, enhancing WSI analysis in pathological imaging.

Abstract

Gigapixel image analysis, particularly for whole slide images (WSIs), often relies on multiple instance learning (MIL). Under the paradigm of MIL, patch image representations are extracted and then fixed during the training of the MIL classifiers for efficiency consideration. However, the invariance of representations makes it difficult to perform data augmentation for WSI-level model training, which significantly limits the performance of the downstream WSI analysis. The current data augmentation methods for gigapixel images either introduce additional computational costs or result in a loss of semantic information, which is hard to meet the requirements for efficiency and stability needed for WSI model training. In this paper, we propose a Promptable Representation Distribution Learning framework (PRDL) for both patch-level representation learning and WSI-level data augmentation. Meanwhile, we explore the use of prompts to guide data augmentation in feature space, which achieves promptable data augmentation for training robust WSI-level models. The experimental results have demonstrated that the proposed method stably outperforms state-of-the-art methods.

Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis

TL;DR

Abstract

Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)