M&M: Tackling False Positives in Mammography with a Multi-view and Multi-instance Learning Sparse Detector
Yen Nhi Truong Vu, Dan Guo, Ahmed Taha, Jason Su, Thomas Paul Matthews
TL;DR
M&M tackles false positives in screening mammography by modeling three clinical realities: a single malignant finding per image, dual-view CC and MLO exams, and a predominance of negative images. It introduces an end-to-end framework combining Sparse R-CNN with dual classification heads, a cross-view attention module, and MIL to train on unannotated images, using MIL aggregation such as NoisyOR to derive image- and breast-level predictions. The method achieves strong detection and breast-level classification performance across five datasets, with recall at 0.1 FP/image of 87.7% on OPTIMAM and a FP-gap reduction to 3.5 points, along with high breast AUCs. The work highlights the value of sparsity, cross-view reasoning, and MIL for clinically relevant mammography analysis and demonstrates practical improvements over dense detectors.
Abstract
Deep-learning-based object detection methods show promise for improving screening mammography, but high rates of false positives can hinder their effectiveness in clinical practice. To reduce false positives, we identify three challenges: (1) unlike natural images, a malignant mammogram typically contains only one malignant finding; (2) mammography exams contain two views of each breast, and both views ought to be considered to make a correct assessment; (3) most mammograms are negative and do not contain any findings. In this work, we tackle the three aforementioned challenges by: (1) leveraging Sparse R-CNN and showing that sparse detectors are more appropriate than dense detectors for mammography; (2) including a multi-view cross-attention module to synthesize information from different views; (3) incorporating multi-instance learning (MIL) to train with unannotated images and perform breast-level classification. The resulting model, M&M, is a Multi-view and Multi-instance learning system that can both localize malignant findings and provide breast-level predictions. We validate M&M's detection and classification performance using five mammography datasets. In addition, we demonstrate the effectiveness of each proposed component through comprehensive ablation studies.
