TriDeNT: Triple Deep Network Training for Privileged Knowledge Distillation in Histopathology
Lucas Farndale, Robert Insall, Ke Yuan
TL;DR
The paper addresses the limitation that computational pathology models often cannot leverage privileged data that are available during training but not at inference. It introduces TriDeNT, a three-branch self-supervised framework that distills information from privileged modalities such as immunohistochemistry, spatial transcriptomics, and expert nuclei annotations into representations learned from a primary histopathology input. Across diverse datasets and tasks, TriDeNT yields substantial improvements over both unprivileged Siamese baselines and, in many cases, supervised baselines, including up to 101% gains, and demonstrates robustness to harmful privileged data and domain shifts. The approach enables learning from scarce or costly data to enhance routine inputs, with potential to uncover biologically meaningful patterns and improve generalization in computational pathology.
Abstract
Computational pathology models rarely utilise data that will not be available for inference. This means most models cannot learn from highly informative data such as additional immunohistochemical (IHC) stains and spatial transcriptomics. We present TriDeNT, a novel self-supervised method for utilising privileged data that is not available during inference to improve performance. We demonstrate the efficacy of this method for a range of different paired data including immunohistochemistry, spatial transcriptomics and expert nuclei annotations. In all settings, TriDeNT outperforms other state-of-the-art methods in downstream tasks, with observed improvements of up to 101%. Furthermore, we provide qualitative and quantitative measurements of the features learned by these models and how they differ from baselines. TriDeNT offers a novel method to distil knowledge from scarce or costly data during training, to create significantly better models for routine inputs.
