TDCNet: Transparent Objects Depth Completion with CNN-Transformer Dual-Branch Parallel Network

Xianghui Fan; Chao Ye; Anping Deng; Xiaotian Wu; Mengyang Pan; Hang Yang

TDCNet: Transparent Objects Depth Completion with CNN-Transformer Dual-Branch Parallel Network

Xianghui Fan, Chao Ye, Anping Deng, Xiaotian Wu, Mengyang Pan, Hang Yang

TL;DR

This paper tackles the challenging problem of depth completion for transparent objects, where traditional sensors struggle due to refraction and low texture. It introduces TDCNet, a CNN–Transformer parallel dual-branch encoder–decoder that separately processes the original depth map and RGB-D features, then fuses them via a Multiscale Feature Fusion Module to recover complete depth maps. A novel adaptive loss strategy modulates the influence of the smoothing term during training, improving convergence and reducing gradient conflicts. Experiments on TransCG, ClearGrasp, and Omniverse show state-of-the-art performance and strong cross-dataset generalization, with the approach preserving depth edges while filling missing regions, benefiting downstream robotic manipulation tasks.

Abstract

The sensing and manipulation of transparent objects present a critical challenge in industrial and laboratory robotics. Conventional sensors face challenges in obtaining the full depth of transparent objects due to the refraction and reflection of light on their surfaces and their lack of visible texture. Previous research has attempted to obtain complete depth maps of transparent objects from RGB and damaged depth maps (collected by depth sensor) using deep learning models. However, existing methods fail to fully utilize the original depth map, resulting in limited accuracy for deep completion. To solve this problem, we propose TDCNet, a novel dual-branch CNN-Transformer parallel network for transparent object depth completion. The proposed framework consists of two different branches: one extracts features from partial depth maps, while the other processes RGB-D images. Experimental results demonstrate that our model achieves state-of-the-art performance across multiple public datasets. Our code and the pre-trained model are publicly available at https://github.com/XianghuiFan/TDCNet.

TDCNet: Transparent Objects Depth Completion with CNN-Transformer Dual-Branch Parallel Network

TL;DR

Abstract

TDCNet: Transparent Objects Depth Completion with CNN-Transformer Dual-Branch Parallel Network

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)