SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation

Tianrun Chen; Runlong Cao; Xinda Yu; Lanyun Zhu; Chaotao Ding; Deyi Ji; Cheng Chen; Qi Zhu; Chunyan Xu; Papa Mao; Ying Zang

SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation

Tianrun Chen, Runlong Cao, Xinda Yu, Lanyun Zhu, Chaotao Ding, Deyi Ji, Cheng Chen, Qi Zhu, Chunyan Xu, Papa Mao, Ying Zang

TL;DR

This paper addresses the gap in fine-grained segmentation where large foundation models struggle, by pairing the Segment Anything 3 (SAM3) backbone with SAM3-Adapter, a lightweight, per-stage adapter system. The adapters generate task-specific prompts $P^i$ from stage inputs $F_i$ (composed of signals like $F_{pe}$ and $F_{hfc}$) via $P^i = { m MLP}{up}(\text{GELU}({\rm MLP}{tune}^i(F_i)))$ and inject them into the transformer layers to tailor segmentation to each domain. The approach delivers state-of-the-art performance across camouflaged object detection ($S_\alpha$, $E_\phi$, MAE), shadow detection (BER), polyp segmentation ($mDice$, $mIoU$), and cell segmentation (F1), while maintaining efficiency by freezing the SAM3 encoder and reusing a compact adapter. Extensive experiments on COD10K, CAMO, CHAMELEON, ISTD, Kvasir-SEG, and NeurIPS 2022 Cell Segmentation demonstrate robust gains and generalizability, supported by open-source code and data processing pipelines. This work proves that scaling SAM3, when combined with intelligent adapters, yields substantial, practical gains for domain-specific segmentation tasks.

Abstract

The rapid rise of large-scale foundation models has reshaped the landscape of image segmentation, with models such as Segment Anything achieving unprecedented versatility across diverse vision tasks. However, previous generations-including SAM and its successor-still struggle with fine-grained, low-level segmentation challenges such as camouflaged object detection, medical image segmentation, cell image segmentation, and shadow detection. To address these limitations, we originally proposed SAM-Adapter in 2023, demonstrating substantial gains on these difficult scenarios. With the emergence of Segment Anything 3 (SAM3)-a more efficient and higher-performing evolution with a redesigned architecture and improved training pipeline-we revisit these long-standing challenges. In this work, we present SAM3-Adapter, the first adapter framework tailored for SAM3 that unlocks its full segmentation capability. SAM3-Adapter not only reduces computational overhead but also consistently surpasses both SAM and SAM2-based solutions, establishing new state-of-the-art results across multiple downstream tasks, including medical imaging, camouflaged (concealed) object segmentation, and shadow detection. Built upon the modular and composable design philosophy of the original SAM-Adapter, SAM3-Adapter provides stronger generalizability, richer task adaptability, and significantly improved segmentation precision. Extensive experiments confirm that integrating SAM3 with our adapter yields superior accuracy, robustness, and efficiency compared to all prior SAM-based adaptations. We hope SAM3-Adapter can serve as a foundation for future research and practical segmentation applications. Code, pre-trained models, and data processing pipelines are available.

SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation

TL;DR

Abstract

SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)