Rethinking Lung Cancer Screening: AI Nodule Detection and Diagnosis Outperforms Radiologists, Leading Models, and Standards Beyond Size and Growth

Sylvain Bodard; Pierre Baudot; Benjamin Renoust; Charles Voyton; Gwendoline De Bie; Ezequiel Geremia; Van-Khoa Le; Danny Francis; Pierre-Henri Siot; Yousra Haddou; Vincent Bobin; Jean-Christophe Brisset; Carey C. Thomson; Valerie Bourdes; Benoit Huet

Rethinking Lung Cancer Screening: AI Nodule Detection and Diagnosis Outperforms Radiologists, Leading Models, and Standards Beyond Size and Growth

Sylvain Bodard, Pierre Baudot, Benjamin Renoust, Charles Voyton, Gwendoline De Bie, Ezequiel Geremia, Van-Khoa Le, Danny Francis, Pierre-Henri Siot, Yousra Haddou, Vincent Bobin, Jean-Christophe Brisset, Carey C. Thomson, Valerie Bourdes, Benoit Huet

TL;DR

This work redefines lung cancer screening by performing both detection and nodule-level malignancy diagnosis on LDCT scans, using a factorized ensemble of shallow models and radiomics to overcome data and explainability limits. The system directly predicts malignancy at the nodule level and integrates context from full-volume CT data, achieving an AUC of 0.98 on internal tests and 0.945 on an independent cohort, while maintaining 0.5 false positives per scan and 99.3% sensitivity. Across sizes and early-stage cancers, the model outperforms radiologists, Lung-RADS, and several leading AI baselines (Sybil, Liao, Ardila, NLST Brock, Mayo), and it surpasses radiologist performance in indeterminate/slow-growing nodules by up to a year. The approach leverages a modular, large-enrolment ensemble combining 3D/2D CNNs, radiomics, and a full-CT context model with calibrated stacking, demonstrating substantial potential to reduce unnecessary follow-ups and enable earlier intervention in lung cancer screening programs.

Abstract

Early detection of malignant lung nodules is critical, but its dependence on size and growth in screening inherently delays diagnosis. We present an AI system that redefines lung cancer screening by performing both detection and malignancy diagnosis directly at the nodule level on low-dose CT scans. To address limitations in dataset scale and explainability, we designed an ensemble of shallow deep learning and feature-based specialized models. Trained and evaluated on 25,709 scans with 69,449 annotated nodules, the system outperforms radiologists, Lung-RADS, and leading AI models (Sybil, Brock, Google, Kaggle). It achieves an area under the receiver operating characteristic curve (AUC) of 0.98 internally and 0.945 on an independent cohort. With 0.5 false positives per scan at 99.3\% sensitivity, it addresses key barriers to AI adoption. Critically, it outperforms radiologists across all nodule sizes and stages, excelling in stage 1 cancers, and all growth-based metrics, including the least accurate: Volume-Doubling Time. It also surpasses radiologists by up to one year in diagnosing indeterminate and slow-growing nodules.

Rethinking Lung Cancer Screening: AI Nodule Detection and Diagnosis Outperforms Radiologists, Leading Models, and Standards Beyond Size and Growth

TL;DR

Abstract

Rethinking Lung Cancer Screening: AI Nodule Detection and Diagnosis Outperforms Radiologists, Leading Models, and Standards Beyond Size and Growth

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (19)