Evaluation framework for Image Segmentation Algorithms
Tatiana Merkulova, Bharani Jayakumar
TL;DR
This paper addresses evaluating segmentation methods across naive, ML, and DL families with a focus on interactive segmentation. It introduces a framework that measures initial and refined performance using $IoU$ alongside computation and user interaction times, enabling fair comparisons between automated, interactive, and hybrid approaches. It shows that DL methods such as U-Net and Mask R-CNN achieve the highest accuracy at the cost of computational resources, while naive methods remain fast baselines and ML methods offer a middle ground. The findings inform practical method selection for different application needs and point to future directions, including real-time feedback and weakly/self-supervised learning to improve efficiency and robustness.
Abstract
This paper presents a comprehensive evaluation framework for image segmentation algorithms, encompassing naive methods, machine learning approaches, and deep learning techniques. We begin by introducing the fundamental concepts and importance of image segmentation, and the role of interactive segmentation in enhancing accuracy. A detailed background theory section explores various segmentation methods, including thresholding, edge detection, region growing, feature extraction, random forests, support vector machines, convolutional neural networks, U-Net, and Mask R-CNN. The implementation and experimental setup are thoroughly described, highlighting three primary approaches: algorithm assisting user, user assisting algorithm, and hybrid methods. Evaluation metrics such as Intersection over Union (IoU), computation time, and user interaction time are employed to measure performance. A comparative analysis presents detailed results, emphasizing the strengths, limitations, and trade-offs of each method. The paper concludes with insights into the practical applicability of these approaches across various scenarios and outlines future work, focusing on expanding datasets, developing more representative approaches, integrating real-time feedback, and exploring weakly supervised and self-supervised learning paradigms to enhance segmentation accuracy and efficiency. Keywords: Image Segmentation, Interactive Segmentation, Machine Learning, Deep Learning, Computer Vision
