CLEAR-HPV: Interpretable Concept Discovery for HPV-Associated Morphology in Whole-Slide Histology
Weiyi Qin, Yingci Liu-Swetz, Shiwei Tan, Hao Wang
TL;DR
CLEAR-HPV reinterprets the attention-guided MIL latent space for HPV-associated histology by discovering discrete, interpretable morphologic concepts without concept-level labels. It operates in the attention-weighted $h$-space to identify keratinizing, basaloid, and stromal patterns, producing spatial concept maps and compact $K$-dimensional concept-fraction vectors (with $K=10$) that preserve the original predictive power. The framework proves backbone-agnostic, generalizes across TCGA-HNSCC, TCGA-CESC, and CPTAC-HNSCC, and demonstrates stable, clinically meaningful concepts across cohorts, enabling interpretability without sacrificing performance. This approach offers a practical path toward interpretable, translatable computational pathology tools and can extend to other MIL-based histopathology tasks.
Abstract
Human papillomavirus (HPV) status is a critical determinant of prognosis and treatment response in head and neck and cervical cancers. Although attention-based multiple instance learning (MIL) achieves strong slide-level prediction for HPV-related whole-slide histopathology, it provides limited morphologic interpretability. To address this limitation, we introduce Concept-Level Explainable Attention-guided Representation for HPV (CLEAR-HPV), a framework that restructures the MIL latent space using attention to enable concept discovery without requiring concept labels during training. Operating in an attention-weighted latent space, CLEAR-HPV automatically discovers keratinizing, basaloid, and stromal morphologic concepts, generates spatial concept maps, and represents each slide using a compact concept-fraction vector. CLEAR-HPV's concept-fraction vectors preserve the predictive information of the original MIL embeddings while reducing the high-dimensional feature space (e.g., 1536 dimensions) to only 10 interpretable concepts. CLEAR-HPV generalizes consistently across TCGA-HNSCC, TCGA-CESC, and CPTAC-HNSCC, providing compact, concept-level interpretability through a general, backbone-agnostic framework for attention-based MIL models of whole-slide histopathology.
