CT-AGRG: Automated Abnormality-Guided Report Generation from 3D Chest CT Volumes
Theo Di Piazza, Carole Lazarus, Olivier Nempont, Loic Boussel
TL;DR
The paper tackles unguided 3D chest CT report generation by introducing CT-AGRG, a two-stage architecture that first detects 18 abnormalities and then generates per-abnormality sentences using a GPT-2 decoder conditioned on anomaly-specific embeddings. The approach leverages a pre-trained visual encoder and per-label projections to create focused embeddings, with a lightweight MLP translating these into textual descriptions via pseudo self-attention. An extensive CT-RATE evaluation shows significant improvements in clinical-efficacy and natural-language-generation metrics over CT2Rep, corroborated by an ablative analysis that confirms the value of multi-task classification and latent-space augmentation. The method offers improved report completeness and clinical relevance while maintaining feasible training requirements on standard hardware, representing a practical advance for automated radiology reporting.
Abstract
The rapid increase of computed tomography (CT) scans and their time-consuming manual analysis have created an urgent need for robust automated analysis techniques in clinical settings. These aim to assist radiologists and help them managing their growing workload. Existing methods typically generate entire reports directly from 3D CT images, without explicitly focusing on observed abnormalities. This unguided approach often results in repetitive content or incomplete reports, failing to prioritize anomaly-specific descriptions. We propose a new anomaly-guided report generation model, which first predicts abnormalities and then generates targeted descriptions for each. Evaluation on a public dataset demonstrates significant improvements in report quality and clinical relevance. We extend our work by conducting an ablation study to demonstrate its effectiveness.
