SUPER: Selfie Undistortion and Head Pose Editing with Identity Preservation
Polina Karpikova, Andrei Spiridonov, Anna Vorontsova, Anastasia Yaschenko, Ekaterina Radionova, Igor Medvedev, Alexander Limonov
TL;DR
This work addresses perspective distortion and head‑pose misalignment in close‑up selfies by introducing SUPER, a hybrid pipeline that jointly optimizes a 3D GAN latent code $w$ and camera parameters $c$ via 3D GAN inversion. It uses depth‑based 3D warping to render a novel view and a visibility‑based blending strategy to seamlessly combine warped texture with GAN‑generated content, thereby preserving identity. A depth‑induced mesh and an encoder‑driven initialization (via TriPlaneNet and Deep3DFaceRecon) enable fast, stable optimization with a final EG3D render producing both an image and a depth map. Experiments on CMDP and the authors’ HeRo dataset demonstrate state‑of‑the‑art performance in both face undistortion and head pose editing, enabling photorealistic selfie editing with improved detail and identity preservation.
Abstract
Self-portraits captured from a short distance might look unnatural or even unattractive due to heavy distortions making facial features malformed, and ill-placed head poses. In this paper, we propose SUPER, a novel method of eliminating distortions and adjusting head pose in a close-up face crop. We perform 3D GAN inversion for a facial image by optimizing camera parameters and face latent code, which gives a generated image. Besides, we estimate depth from the obtained latent code, create a depth-induced 3D mesh, and render it with updated camera parameters to obtain a warped portrait. Finally, we apply the visibility-based blending so that visible regions are reprojected, and occluded parts are restored with a generative model. Experiments on face undistortion benchmarks and on our self-collected Head Rotation dataset (HeRo), show that SUPER outperforms previous approaches both qualitatively and quantitatively, opening new possibilities for photorealistic selfie editing.
