Efficient Special Stain Classification
Oskar Thaeter, Christian Grashei, Anette Haas, Elisa Schmoeckel, Han Li, Peter J. Schüffler
TL;DR
This study tackles automating the classification of 16 stain classes, including 14 common special stains and H&E variants, to improve metadata quality in digital pathology. It compares a computationally intensive patch-based Multi-Instance Learning approach with ABMIL against a lightweight thumbnail-based method, evaluating both internal (TUM) and external (TCGA) data. Internal results favor MIL for maximum slide-level precision, while external generalization and throughput strongly favor the thumbnail classifier, which achieves orders-of-magnitude higher throughput and better domain robustness. The findings suggest a practical trade-off: patch-based methods excel in fine-grained discrimination, but thumbnail-based classification provides a scalable, robust, and deployment-friendly solution for routine quality control in institutional digital pathology workflows, with potential extensions to uncertainty estimation and open-set handling.
Abstract
Stains are essential in histopathology to visualize specific tissue characteristics, with Haematoxylin and Eosin (H&E) serving as the clinical standard. However, pathologists frequently utilize a variety of special stains for the diagnosis of specific morphologies. Maintaining accurate metadata for these slides is critical for quality control in clinical archives and for the integrity of computational pathology datasets. In this work, we compare two approaches for automated classification of stains using whole slide images, covering the 14 most commonly used special stains in our institute alongside standard and frozen-section H&E. We evaluate a Multi-Instance Learning (MIL) pipeline and a proposed lightweight thumbnail-based approach. On internal test data, MIL achieved the highest performance (macro F1: 0.941 for 16 classes; 0.969 for 14 merged classes), while the thumbnail approach remained competitive (0.897 and 0.953, respectively). On external TCGA data, the thumbnail model generalized best (weighted F1: 0.843 vs. 0.807 for MIL). The thumbnail approach also increased throughput by two orders of magnitude (5.635 vs. 0.018 slides/s for MIL with all patches). We conclude that thumbnail-based classification provides a scalable and robust solution for routine visual quality control in digital pathology workflows.
