CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation
Jialun Pei, Tao Jiang, He Tang, Nian Liu, Yueming Jin, Deng-Ping Fan, Pheng-Ann Heng
TL;DR
CalibNet advances RGB-D salient instance segmentation by introducing a dual-branch cross-modal calibration framework that tightly fuses depth and RGB information in both the kernel and mask branches. The Dynamic Interactive Kernel and Weight-Sharing Fusion modules, together with a Depth Similarity Assessment, enable instance-aware kernel generation and robust mask feature calibration, all trained with bipartite matching. The paper also contributes the DSIS dataset, providing a higher-quality, multi-category RGB-D SIS benchmark for generalization studies. Empirical results show state-of-the-art performance on COME15K and DSIS across multiple setups, with real-time inference and strong robustness to depth quality variations, highlighting the practical impact of cross-modal calibration in multi-modal segmentation tasks.
Abstract
We propose a novel approach for RGB-D salient instance segmentation using a dual-branch cross-modal feature calibration architecture called CalibNet. Our method simultaneously calibrates depth and RGB features in the kernel and mask branches to generate instance-aware kernels and mask features. CalibNet consists of three simple modules, a dynamic interactive kernel (DIK) and a weight-sharing fusion (WSF), which work together to generate effective instance-aware kernels and integrate cross-modal features. To improve the quality of depth features, we incorporate a depth similarity assessment (DSA) module prior to DIK and WSF. In addition, we further contribute a new DSIS dataset, which contains 1,940 images with elaborate instance-level annotations. Extensive experiments on three challenging benchmarks show that CalibNet yields a promising result, i.e., 58.0% AP with 320*480 input size on the COME15K-N test set, which significantly surpasses the alternative frameworks. Our code and dataset are available at: https://github.com/PJLallen/CalibNet.
