VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction
Haoyu Wu, Ziqiao Peng, Xukun Zhou, Yunfei Cheng, Jun He, Hongyan Liu, Zhaoxin Fan
TL;DR
This work tackles the long-standing gap in monocular 3D face reconstruction where texture quality lags behind geometric accuracy. It introduces VGG-Tex, a geometry-guided texture estimation framework that leverages FLAME-based priors through a dual-branch network (FAEM and CGTG), complemented by a Visibility-Enhanced Texture Completion module and a Texture-Guided Geometry Refinement training stage. The approach yields high-fidelity UV textures and competitive geometry on standard benchmarks, outperforming prior texture-focused methods while maintaining robust geometry reconstruction. These advances enable more realistic renderings for applications in AR/VR, animation, and telepresence by producing more faithful facial textures under varied poses and occlusions.
Abstract
3D face reconstruction from monocular images has promoted the development of various applications such as augmented reality. Though existing methods have made remarkable progress, most of them emphasize geometric reconstruction, while overlooking the importance of texture prediction. To address this issue, we propose VGG-Tex, a novel Vivid Geometry-Guided Facial Texture Estimation model designed for High Fidelity Monocular 3D Face Reconstruction. The core of this approach is leveraging 3D parametric priors to enhance the outcomes of 2D UV texture estimation. Specifically, VGG-Tex includes a Facial Attributes Encoding Module, a Geometry-Guided Texture Generator, and a Visibility-Enhanced Texture Completion Module. These components are responsible for extracting parametric priors, generating initial textures, and refining texture details, respectively. Based on the geometry-texture complementarity principle, VGG-Tex also introduces a Texture-guided Geometry Refinement Module to further balance the overall fidelity of the reconstructed 3D faces, along with corresponding losses. Comprehensive experiments demonstrate that our method significantly improves texture reconstruction performance compared to existing state-of-the-art methods.
