Cell Maps Representation For Lung Adenocarcinoma Growth Patterns Classification In Whole Slide Images
Arwa Al-Rubaian, Gozde N. Gunesli, Wajd A. Althakfi, Ayesha Azam, Nasir Rajpoot, Shan E Ahmed Raza
TL;DR
This work tackles the challenge of classifying LUAD tissue growth patterns at the tile level within whole slide images, a task confounded by tumor heterogeneity. It introduces a cell maps representation, generated from H&E WSIs using Hover-Net to produce three nucleus-type masks that are stacked into a compact input for a CNN (ResNet50) trained to predict six classes (five patterns plus normal). Under proper WSI-based validation, the proposed method outperforms state-of-the-art baselines and demonstrates robust generalization on unseen slides, addressing biases common in tile-based splits. The approach has potential to improve prognostic assessments and streamline pathology workflows by focusing on cellular composition rather than solely tissue morphology.
Abstract
Lung adenocarcinoma is a morphologically heterogeneous disease, characterized by five primary histologic growth patterns. The quantity of these patterns can be related to tumor behavior and has a significant impact on patient prognosis. In this work, we propose a novel machine learning pipeline capable of classifying tissue tiles into one of the five patterns or as non-tumor, with an Area Under the Receiver Operating Characteristic Curve (AUCROC) score of 0.97. Our model's strength lies in its comprehensive consideration of cellular spatial patterns, where it first generates cell maps from Hematoxylin and Eosin (H&E) whole slide images (WSIs), which are then fed into a convolutional neural network classification model. Exploiting these cell maps provides the model with robust generalizability to new data, achieving approximately 30% higher accuracy on unseen test-sets compared to current state of the art approaches. The insights derived from our model can be used to predict prognosis, enhancing patient outcomes.
