Location-based Radiology Report-Guided Semi-supervised Learning for Prostate Cancer Detection
Alex Chen, Nathan Lay, Stephanie Harmon, Kutsev Ozyoruk, Enis Yilmaz, Brad J. Wood, Peter A. Pinto, Peter L. Choyke, Baris Turkbey
TL;DR
This work tackles the annotation bottleneck in MRI-based prostate cancer detection by introducing a lesion location-guided semi-supervised learning framework that leverages radiology report information to refine pseudo labels on unlabeled data. A teacher–student architecture with nnU-Net segmentation uses report-derived lesion locations to correct pseudo labels, outperforming supervised and lesion-count SSL methods, particularly when manual annotations are scarce. The approach maintains competitive segmentation accuracy while reducing false positives, demonstrating a practical path to scale prostate cancer detection models with larger unlabeled datasets. Limitations include reliance on structured reports and PI-RADS-based localization, suggesting future integration with unstructured-report processing and extension to other organ systems.
Abstract
Prostate cancer is one of the most prevalent malignancies in the world. While deep learning has potential to further improve computer-aided prostate cancer detection on MRI, its efficacy hinges on the exhaustive curation of manually annotated images. We propose a novel methodology of semisupervised learning (SSL) guided by automatically extracted clinical information, specifically the lesion locations in radiology reports, allowing for use of unannotated images to reduce the annotation burden. By leveraging lesion locations, we refined pseudo labels, which were then used to train our location-based SSL model. We show that our SSL method can improve prostate lesion detection by utilizing unannotated images, with more substantial impacts being observed when larger proportions of unannotated images are used.
