Efficient Automated Diagnosis of Retinopathy of Prematurity by Customize CNN Models
Farzan Saeedi, Sanaz Keshvari, Nasser Shoeibi
TL;DR
This study tackles automated Retinopathy of Prematurity (ROP) diagnosis under data-scarce conditions by coupling data augmentation with a customized CNN architecture and a voting system across multiple retinal views. Leveraging a MobileNetV2 backbone as a baseline, the authors design a task-specific CNN that uses 1×1 and 2×2 convolutions, Batch Normalization, and a final fully connected layer to extract ~160 features before binary classification, optimized with Binary CrossEntropy. Experimental results show that while pre-trained MobileNet benefits from normalization and augmentation, the customized CNN with voting substantially outperforms the baseline and can achieve near-perfect accuracy after fine-tuning, with favorable time-complexity trade-offs suitable for deployment on specialized hardware. The approach promises practical clinical utility by delivering accurate, efficient ROP screening and highlighting directions for extending the method to stage/zone classification and on-device deployment.
Abstract
This paper encompasses an in-depth examination of Retinopathy of Prematurity (ROP) diagnosis, employing advanced deep learning methodologies. Our focus centers on refining and evaluating CNN-based approaches for precise and efficient ROP detection. We navigate the complexities of dataset curation, preprocessing strategies, and model architecture, aligning with research objectives encompassing model effectiveness, computational cost analysis, and time complexity assessment. Results underscore the supremacy of tailored CNN models over pre-trained counterparts, evident in heightened accuracy and F1-scores. Implementation of a voting system further enhances performance. Additionally, our study reveals the potential of the proposed customized CNN model to alleviate computational burdens associated with deep neural networks. Furthermore, we showcase the feasibility of deploying these models within dedicated software and hardware configurations, highlighting their utility as valuable diagnostic aids in clinical settings. In summary, our discourse significantly contributes to ROP diagnosis, unveiling the efficacy of deep learning models in enhancing diagnostic precision and efficiency.
