CellOMaps: A Compact Representation for Robust Classification of Lung Adenocarcinoma Growth Patterns
Arwa Al-Rubaian, Gozde N. Gunesli, Wajd A. Althakfi, Ayesha Azam, David Snead, Nasir M. Rajpoot, Shan E Ahmed Raza
TL;DR
This study introduces CellOMaps, a compact, nuclei-centric representation for LUAD growth pattern classification, enabling robust, generalizable discrimination across five histological patterns plus normal tissue. By detecting nuclei with HoVer-Net, encoding their types and centroids into a 3-channel map, and classifying using a dilated ResNet-50 on 448×448 tiles, the approach achieves state-of-the-art performance under patient-level evaluation and demonstrates strong generalization on external datasets. The work also presents preliminary evidence that pattern-level outputs can inform Tumor Mutational Burden (TMB) stratification, highlighting potential clinical utility. The authors provide extensive ablations, validate on multiple datasets, and discuss limitations such as reliance on nuclei detection accuracy, while making code and annotations publicly available for reproducibility and further research.
Abstract
Lung adenocarcinoma (LUAD) is a morphologically heterogeneous disease, characterized by five primary histological growth patterns. The classification of such patterns is crucial due to their direct relation to prognosis but the high subjectivity and observer variability pose a major challenge. Although several studies have developed machine learning methods for growth pattern classification, they either only report the predominant pattern per slide or lack proper evaluation. We propose a generalizable machine learning pipeline capable of classifying lung tissue into one of the five patterns or as non-tumor. The proposed pipeline's strength lies in a novel compact Cell Organization Maps (cellOMaps) representation that captures the cellular spatial patterns from Hematoxylin and Eosin whole slide images (WSIs). The proposed pipeline provides state-of-the-art performance on LUAD growth pattern classification when evaluated on both internal unseen slides and external datasets, significantly outperforming the current approaches. In addition, our preliminary results show that the model's outputs can be used to predict patients Tumor Mutational Burden (TMB) levels.
