Tissue Concepts: supervised foundation models in computational pathology
Till Nicke, Jan Raphael Schaefer, Henning Hoefener, Friedrich Feuerhake, Dorit Merhof, Fabian Kiessling, Johannes Lotz
TL;DR
This work presents Tissue Concepts, a supervised foundation-model-style encoder trained via multi-task learning on 16 pathology-related tasks to produce a robust tissue-concept representation with approximately $912{,}000$ patches. The encoder is evaluated through a MIL-based WSI classification pipeline across four cancers (breast, colon, lung, prostate) and multiple centers, showing performance comparable to self-supervised baselines while using only a fraction of the data and resources. Across breast, prostate, colorectal, and lung datasets, Tissue Concepts demonstrates strong cross-center generalization and outperforms ImageNet baselines, underscoring the value of domain-specific supervised pre-training for computational pathology. The study highlights data- and energy-efficiency benefits, while identifying ongoing challenges in cross-center transfer and the potential for organ-specific fine-tuning to further improve performance and generalizability.
Abstract
Due to the increasing workload of pathologists, the need for automation to support diagnostic tasks and quantitative biomarker evaluation is becoming more and more apparent. Foundation models have the potential to improve generalizability within and across centers and serve as starting points for data efficient development of specialized yet robust AI models. However, the training foundation models themselves is usually very expensive in terms of data, computation, and time. This paper proposes a supervised training method that drastically reduces these expenses. The proposed method is based on multi-task learning to train a joint encoder, by combining 16 different classification, segmentation, and detection tasks on a total of 912,000 patches. Since the encoder is capable of capturing the properties of the samples, we term it the Tissue Concepts encoder. To evaluate the performance and generalizability of the Tissue Concepts encoder across centers, classification of whole slide images from four of the most prevalent solid cancers - breast, colon, lung, and prostate - was used. The experiments show that the Tissue Concepts model achieve comparable performance to models trained with self-supervision, while requiring only 6% of the amount of training patches. Furthermore, the Tissue Concepts encoder outperforms an ImageNet pre-trained encoder on both in-domain and out-of-domain data.
