U-Net in Medical Image Segmentation: A Review of Its Applications Across Modalities
Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Sonavi Makarand Dalvi, Nikolaos Mantzou, Safa Shubbar
TL;DR
Medical image segmentation across X-ray, MRI, CT, and Ultrasound is pivotal for diagnosis and treatment planning. The paper surveys U-Net and its variants (U-Net++, U-Net 3+) and analyzes their architectural differences, modality applications, and performance implications. It highlights limitations such as data scarcity, noise and artifacts, and computational complexity, and proposes directions like model compression, synthetic data, multimodal transformers, and explainable AI. The review aims to guide researchers and clinicians in selecting appropriate architectures and datasets to improve clinical decision-making and patient outcomes.
Abstract
Medical imaging is essential in healthcare to provide key insights into patient anatomy and pathology, aiding in diagnosis and treatment. Non-invasive techniques such as X-ray, Magnetic Resonance Imaging (MRI), Computed Tomography (CT), and Ultrasound (US), capture detailed images of organs, tissues, and abnormalities. Effective analysis of these images requires precise segmentation to delineate regions of interest (ROI), such as organs or lesions. Traditional segmentation methods, relying on manual feature-extraction, are labor-intensive and vary across experts. Recent advancements in Artificial Intelligence (AI) and Deep Learning (DL), particularly convolutional models such as U-Net and its variants (U-Net++ and U-Net 3+), have transformed medical image segmentation (MIS) by automating the process and enhancing accuracy. These models enable efficient, precise pixel-wise classification across various imaging modalities, overcoming the limitations of manual segmentation. This review explores various medical imaging techniques, examines the U-Net architectures and their adaptations, and discusses their application across different modalities. It also identifies common challenges in MIS and proposes potential solutions.
