Deepfake for the Good: Generating Avatars through Face-Swapping with Implicit Deepfake Generation
Georgii Stanishevskii, Jakub Steczkiewicz, Tomasz Szczepanik, Sławomir Tadeja, Jacek Tabor, Przemysław Spurek
TL;DR
ImplicitDeepfake presents a practical pipeline to generate plausible 3D avatars from a single image by applying 2D deepfake or diffusion edits to training views and then training either Neural Radiance Fields or Gaussian Splatting. The method leverages NeRF for accurate volumetric rendering and GS for faster, sharper renders, with diffusion-based editing enabling text-conditioned avatar modification. Empirical results show GS often yields crisper visual quality and robustness to viewpoint variations, while NeRF remains capable but can blur under certain 2D inconsistencies; dynamic avatars are demonstrated via NerFace integration. The work enables next-generation avatar creation for virtual environments and gaming, while underscoring important societal and ethical considerations surrounding deepfake technologies.
Abstract
Numerous emerging deep-learning techniques have had a substantial impact on computer graphics. Among the most promising breakthroughs are the rise of Neural Radiance Fields (NeRFs) and Gaussian Splatting (GS). NeRFs encode the object's shape and color in neural network weights using a handful of images with known camera positions to generate novel views. In contrast, GS provides accelerated training and inference without a decrease in rendering quality by encoding the object's characteristics in a collection of Gaussian distributions. These two techniques have found many use cases in spatial computing and other domains. On the other hand, the emergence of deepfake methods has sparked considerable controversy. Deepfakes refers to artificial intelligence-generated videos that closely mimic authentic footage. Using generative models, they can modify facial features, enabling the creation of altered identities or expressions that exhibit a remarkably realistic appearance to a real person. Despite these controversies, deepfake can offer a next-generation solution for avatar creation and gaming when of desirable quality. To that end, we show how to combine all these emerging technologies to obtain a more plausible outcome. Our ImplicitDeepfake uses the classical deepfake algorithm to modify all training images separately and then train NeRF and GS on modified faces. Such simple strategies can produce plausible 3D deepfake-based avatars.
