Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data
Yu Deng, Duomin Wang, Xiaohang Ren, Xingyu Chen, Baoyuan Wang
TL;DR
Portrait4D presents a data-driven solution to one-shot 4D head avatar synthesis by decoupling data generation from real-image reconstruction. It first learns GenHead, a part-wise, shape-conditioned 4D head generator trained on monocular images to produce large-scale synthetic multi-view, full-motion data, then trains a transformer-based animatable triplane reconstructor Psi on this synthetic data to reconstruct 4D heads from real images with disentangled learning to improve generalization. The key contributions are the GenHead architecture with a part-wise deformation field and FLAME-based morphing, the synthetic-data-driven 4D head reconstruction pipeline, and the disentangled training strategy that isolates reconstruction from reenactment. Experiments show state-of-the-art fidelity, 3D consistency, and motion control, enabling fast, photorealistic head avatars with foreground-background separation for applications in video, VR, and telepresence. The approach highlights the potential of synthetic supervision to scale 4D head synthesis while acknowledging current limitations and ethical considerations.
Abstract
Existing one-shot 4D head synthesis methods usually learn from monocular videos with the aid of 3DMM reconstruction, yet the latter is evenly challenging which restricts them from reasonable 4D head synthesis. We present a method to learn one-shot 4D head synthesis via large-scale synthetic data. The key is to first learn a part-wise 4D generative model from monocular images via adversarial learning, to synthesize multi-view images of diverse identities and full motions as training data; then leverage a transformer-based animatable triplane reconstructor to learn 4D head reconstruction using the synthetic data. A novel learning strategy is enforced to enhance the generalizability to real images by disentangling the learning process of 3D reconstruction and reenactment. Experiments demonstrate our superiority over the prior art.
