Radio-opaque artefacts in digital mammography: automatic detection and analysis of downstream effects
Amelia Schueppert, Ben Glocker, Mélanie Roschewitz
TL;DR
Investigates how radio-opaque artefacts in mammography affect machine learning classifiers used for cancer screening and density assessment. Builds a large, manually annotated artefact dataset (22,012 images) and trains a multi-label detector with ResNet-50 to identify five artefact types. Demonstrates artefacts are prevalent (≈22% of images) and can significantly degrade downstream task performance and shift output distributions and thresholds. All annotations, code, and predictions are released to facilitate robust evaluation and bias-aware development of mammography AI systems.
Abstract
This study investigates the effects of radio-opaque artefacts, such as skin markers, breast implants, and pacemakers, on mammography classification models. After manually annotating 22,012 mammograms from the publicly available EMBED dataset, a robust multi-label artefact detector was developed to identify five distinct artefact types (circular and triangular skin markers, breast implants, support devices and spot compression structures). Subsequent experiments on two clinically relevant tasks $-$ breast density assessment and cancer screening $-$ revealed that these artefacts can significantly affect model performance, alter classification thresholds, and distort output distributions. These findings underscore the importance of accurate automatic artefact detection for developing reliable and robust classification models in digital mammography. To facilitate future research our annotations, code, and model predictions are made publicly available.
