Advances in Deep Concealed Scene Understanding
Deng-Ping Fan, Ge-Peng Ji, Peng Xu, Ming-Ming Cheng, Christos Sakaridis, Luc Van Gool
TL;DR
This work surveys deep learning-driven Concealed Scene Understanding (CSU), delineating image- and video-level tasks (COS, COL, CIR, CIS, COC, VCOD, VCOS) and their formulations. It contributes the largest COS benchmark, introduces CDS2K for industrial defect concealment, and discusses open problems and directions, including domain adaptation and data-efficient learning. The paper provides extensive quantitative and qualitative benchmarks across COD10K, NC4K, CAMO, and the CDS2K dataset, highlighting the rise of transformer-based methods (e.g., CamoFormer, HitNet) for improved detection of camouflaged objects and the ongoing need for cross-domain generalization. The findings underscore the potential of semantic-level reasoning and vision-language integration to bridge human and machine understanding in concealed scenes, with practical impact in safety, industry inspection, and medical imaging.
Abstract
Concealed scene understanding (CSU) is a hot computer vision topic aiming to perceive objects exhibiting camouflage. The current boom in terms of techniques and applications warrants an up-to-date survey. This can help researchers to better understand the global CSU field, including both current achievements and remaining challenges. This paper makes four contributions: (1) For the first time, we present a comprehensive survey of deep learning techniques aimed at CSU, including a taxonomy, task-specific challenges, and ongoing developments. (2) To allow for an authoritative quantification of the state-of-the-art, we offer the largest and latest benchmark for concealed object segmentation (COS). (3) To evaluate the generalizability of deep CSU in practical scenarios, we collect the largest concealed defect segmentation dataset termed CDS2K with the hard cases from diversified industrial scenarios, on which we construct a comprehensive benchmark. (4) We discuss open problems and potential research directions for CSU. Our code and datasets are available at https://github.com/DengPingFan/CSU, which will be updated continuously to watch and summarize the advancements in this rapidly evolving field.
