Total Selfie: Generating Full-Body Selfies
Bowei Chen, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz
TL;DR
The paper addresses the challenge of producing full-body selfies from arm-length photographs, which suffer from limited field of view and perspective distortion. It introduces Total Selfie, a diffusion-based framework that generates a full-body image in a target pose from four input selfies (face, upper body, lower body, shoes) and a background, guided by an automatically selected reference pose. The approach comprises a selfie-conditioned inpainting model trained on a synthetic four-selfie-to-full-body dataset, followed by per-capture fine-tuning, face undistortion, target-pose selection, and pose-guided generation with a ControlNet, plus appearance refinement to preserve identity and clothing. Experimental results on twelve individuals across diverse scenes show convincing, high-fidelity outputs with improved realism versus baselines and ablations, demonstrating the method’s practical potential for realistic background composition and pose transfer.
Abstract
We present a method to generate full-body selfies from photographs originally taken at arms length. Because self-captured photos are typically taken close up, they have limited field of view and exaggerated perspective that distorts facial shapes. We instead seek to generate the photo some one else would take of you from a few feet away. Our approach takes as input four selfies of your face and body, a background image, and generates a full-body selfie in a desired target pose. We introduce a novel diffusion-based approach to combine all of this information into high-quality, well-composed photos of you with the desired pose and background.
