Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach
Ziheng Zhao, Lisong Dai, Ya Zhang, Yanfeng Wang, Weidi Xie
TL;DR
The paper advances abnormality-centric CT interpretation by introducing a 404-category taxonomy and OmniAbnorm-CT-14K, a large-scale multi-plane whole-body CT dataset with detailed grounding annotations. It then proposes OmniAbnorm-CT, a vision-language-grounded system that grounding abnormalities and generates clinically oriented descriptions under text prompts or visual cues. The authors define three practical tasks and a clinically grounded AbnormRubric metric, showing substantial improvements over baselines in both internal and external validations. Together, these contributions enable more explainable, comprehensive, and actionable CT interpretation across the entire body, with potential to transform radiology workflows.
Abstract
Automated interpretation of CT images-particularly localizing and describing abnormal findings across multi-plane and whole-body scans-remains a significant challenge in clinical radiology. This work aims to address this challenge through four key contributions: (i) On taxonomy, we collaborate with senior radiologists to propose a comprehensive hierarchical classification system, with 404 representative abnormal findings across all body regions; (ii) On data, we contribute a dataset containing over 14.5K CT images from multiple planes and all human body regions, and meticulously provide grounding annotations for over 19K abnormalities, each linked to the detailed description and cast into the taxonomy; (iii) On model development, we propose OmniAbnorm-CT, which can automatically ground and describe abnormal findings on multi-plane and whole-body CT images based on text queries, while also allowing flexible interaction through visual prompts; (iv) On evaluation, we establish three representative tasks based on real clinical scenarios, and introduce a clinically grounded metric to assess abnormality descriptions. Through extensive experiments, we show that OmniAbnorm-CT can significantly outperform existing methods in both internal and external validations, and across all the tasks.
