Segment Anything Is Not Always Perfect: An Investigation of SAM on Different Real-world Applications
Wei Ji, Jingjing Li, Qi Bi, Tingwei Liu, Wenbo Li, Li Cheng
TL;DR
This paper evaluates the Segment Anything Model (SAM) across a broad set of real-world segmentation tasks—natural images, agriculture, manufacturing, remote sensing, and healthcare—using qualitative visual analyses and quantitative MAE-based metrics. It shows that SAM generalizes well for common scenes and straightforward prompts, but struggles with fine-grained details, low-contrast scenes, small or irregular targets, and professional medical data, leading to notable performance gaps with state-of-the-art methods in several domains. The authors discuss practical limitations, such as foreground bias and reliance on domain knowledge, and propose future directions including domain-specific datasets, expanded prompting, multi-modal inputs, video-capable variants, and semi-supervised strategies to adapt SAM to specialized tasks. Overall, the work provides a structured assessment of SAM’s applicability beyond natural images and outlines concrete paths to improve robustness and domain adaptation for practical segmentation challenges.
Abstract
Recently, Meta AI Research approaches a general, promptable Segment Anything Model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B). Without a doubt, the emergence of SAM will yield significant benefits for a wide array of practical image segmentation applications. In this study, we conduct a series of intriguing investigations into the performance of SAM across various applications, particularly in the fields of natural images, agriculture, manufacturing, remote sensing, and healthcare. We analyze and discuss the benefits and limitations of SAM, while also presenting an outlook on its future development in segmentation tasks. By doing so, we aim to give a comprehensive understanding of SAM's practical applications. This work is expected to provide insights that facilitate future research activities toward generic segmentation. Source code is publicly available.
