SPEGNet: Synergistic Perception-Guided Network for Camouflaged Object Detection
Baber Jan, Saeed Anwar, Aiman H. El-Maleh, Abdul Jabbar Siddiqui, Abdul Bais
TL;DR
Camouflaged object detection (COD) is hindered by intrinsic similarity and edge disruption, which traditional architectures address via accumulating modules and reduced-resolution processing. SPEGNet introduces a synergistic design with Contextual Feature Integration, Edge Feature Extraction, and Progressive Edge-guided Decoder to fuse multi-scale context and boundary information in a single, cohesive framework. It delivers state-of-the-art performance on COD10K, NC4K, and CAMO with real-time inference, and demonstrates strong cross-domain transfer to medical imaging and agriculture without architectural changes. The work highlights a shift from modular accumulation toward principled integration of perception mechanisms, and discusses remaining challenges such as resolution-dependent detection boundaries and annotation quality in COD benchmarks.
Abstract
Camouflaged object detection segments objects with intrinsic similarity and edge disruption. Current detection methods rely on accumulated complex components. Each approach adds components such as boundary modules, attention mechanisms, and multi-scale processors independently. This accumulation creates a computational burden without proportional gains. To manage this complexity, they process at reduced resolutions, eliminating fine details essential for camouflage. We present SPEGNet, addressing fragmentation through a unified design. The architecture integrates multi-scale features via channel calibration and spatial enhancement. Boundaries emerge directly from context-rich representations, maintaining semantic-spatial alignment. Progressive refinement implements scale-adaptive edge modulation with peak influence at intermediate resolutions. This design strikes a balance between boundary precision and regional consistency. SPEGNet achieves 0.887 $S_α$ on CAMO, 0.890 on COD10K, and 0.895 on NC4K, with real-time inference speed. Our approach excels across scales, from tiny, intricate objects to large, pattern-similar ones, while handling occlusion and ambiguous boundaries. Code, model weights, and results are available on \href{https://github.com/Baber-Jan/SPEGNet}{https://github.com/Baber-Jan/SPEGNet}.
