Does context matter in digital pathology?
Paulina Tomaszewska, Mateusz Sperkowski, Przemysław Biecek
TL;DR
This study investigates whether deep learning vision models in digital pathology rely on contextual tissue information when classifying histopathology patches. By systematically abating context with a black border around a central patch in PatchCamelyon data, the authors quantify how context size affects performance across CNNs and Transformer-based architectures, revealing that recall and overall accuracy degrade as context is reduced. They show variation in sensitivity to context across architectures and pretraining schemes (e.g., Swin, ViT variants, MoCo, MAE), with some models exhibiting frequent prediction swings as context changes. The work highlights the risk of relying on partial context in clinical settings and suggests directions for explanation methods and histopathologist collaboration to validate swinging cases and improve robustness.
Abstract
The development of Artificial Intelligence for healthcare is of great importance. Models can sometimes achieve even superior performance to human experts, however, they can reason based on spurious features. This is not acceptable to the experts as it is expected that the models catch the valid patterns in the data following domain expertise. In the work, we analyse whether Deep Learning (DL) models for vision follow the histopathologists' practice so that when diagnosing a part of a lesion, they take into account also the surrounding tissues which serve as context. It turns out that the performance of DL models significantly decreases when the amount of contextual information is limited, therefore contextual information is valuable at prediction time. Moreover, we show that the models sometimes behave in an unstable way as for some images, they change the predictions many times depending on the size of the context. It may suggest that partial contextual information can be misleading.
