Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry
Cho-Ying Wu, Qiangeng Xu, Ulrich Neumann
TL;DR
This work addresses the ill-posed problem of recovering complete 3D facial geometry from monocular images by introducing a bidirectional synergy between 3DMM parameters and 3D landmarks. The approach, SynergyNet, uses a two-stage pipeline with MAFA landmark refinement and a landmark-to-3DMM module to establish a representation cycle that alternates between predicting 3DMM parameters from images and regressing 3DMM parameters from refined landmarks. Key contributions include the multi-attribute feature aggregation for landmark refinement, the reverse representation direction, and a self-supervised consistency loss that enhances information flow, yielding state-of-the-art results for facial alignment, face orientation estimation, and 3D face modeling on AFLW and Florence benchmarks. The method relies on simple, fast network blocks to achieve high throughput (≈2600fps for landmarks and ≈2300fps for dense 3D faces) and demonstrates robustness across large pose variations, with supplementary texture synthesis to generate more realistic textures.
Abstract
This work studies learning from a synergy process of 3D Morphable Models (3DMM) and 3D facial landmarks to predict complete 3D facial geometry, including 3D alignment, face orientation, and 3D face modeling. Our synergy process leverages a representation cycle for 3DMM parameters and 3D landmarks. 3D landmarks can be extracted and refined from face meshes built by 3DMM parameters. We next reverse the representation direction and show that predicting 3DMM parameters from sparse 3D landmarks improves the information flow. Together we create a synergy process that utilizes the relation between 3D landmarks and 3DMM parameters, and they collaboratively contribute to better performance. We extensively validate our contribution on full tasks of facial geometry prediction and show our superior and robust performance on these tasks for various scenarios. Particularly, we adopt only simple and widely-used network operations to attain fast and accurate facial geometry prediction. Codes and data: https://choyingw.github.io/works/SynergyNet/
