Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face Recognition
Ruizhuo Xu, Ke Wang, Chao Deng, Mei Wang, Xi Chen, Wenhui Huang, Junlan Feng, Weihong Deng
TL;DR
This work tackles the challenge of noisy, low-quality depth maps for 3D face recognition by introducing DMDNet, a depth map denoising network based on the Denoising Implicit Image Function (DIIF). DMDNet incorporates coordinate-aware latent codes and multi-scale decoding to produce cleaner depth faces while preserving identity, and is complemented by LDNFNet, a lightweight fusion network that jointly leverages depth and normal modalities through a multi-branch fusion architecture. Across multiple datasets, DMDNet achieves superior denoising performance and, when combined with LDNFNet, sets new state-of-the-art results on the Lock3DFace database, with demonstrated generalization to diverse depth sensors. The approach advances 3D FR by integrating implicit representations for denoising and a compact, effective fusion mechanism for multimodal cues, offering practical impact for affordable, robust 3D facial recognition applications.
Abstract
With the increasing availability of consumer depth sensors, 3D face recognition (FR) has attracted more and more attention. However, the data acquired by these sensors are often coarse and noisy, making them impractical to use directly. In this paper, we introduce an innovative Depth map denoising network (DMDNet) based on the Denoising Implicit Image Function (DIIF) to reduce noise and enhance the quality of facial depth images for low-quality 3D FR. After generating clean depth faces using DMDNet, we further design a powerful recognition network called Lightweight Depth and Normal Fusion network (LDNFNet), which incorporates a multi-branch fusion block to learn unique and complementary features between different modalities such as depth and normal images. Comprehensive experiments conducted on four distinct low-quality databases demonstrate the effectiveness and robustness of our proposed methods. Furthermore, when combining DMDNet and LDNFNet, we achieve state-of-the-art results on the Lock3DFace database.
