LiveNeRF: Efficient Face Replacement Through Neural Radiance Fields Integration
Tung Vu, Hai Nguyen, Cong Tran
TL;DR
LiveNeRF proposes a unified, real-time framework that integrates face replacement directly into NeRF-based rendering to synthesize audio-driven talking heads from a single reference image. By combining a tri-plane hash representation, region- and eyeblink-aware conditioning, and an integrated face replacement module, the approach achieves competitive visual fidelity (PSNR ≈ 33 dB, LPIPS ≈ 0.031) at 33 FPS, while enabling zero-shot deployment without subject-specific training. The paper provides a theoretical complexity analysis and extensive empirical results demonstrating real-time performance, cross-subject robustness, and favorable comparisons to diffusion-based and Gaussian Splatting methods. This integration reduces computational overhead and enables practical deployment in live streaming, telepresence, and interactive media, albeit with acknowledged ethical considerations and potential misuse that must be mitigated with provenance verification and detection. Overall, LiveNeRF advances real-time, identity-preserving neural rendering by unifying motion synthesis and photorealistic rendering in a single, efficient framework with practical impact for interactive applications.
Abstract
Face replacement technology enables significant advancements in entertainment, education, and communication applications, including dubbing, virtual avatars, and cross-cultural content adaptation. Our LiveNeRF framework addresses critical limitations of existing methods by achieving real-time performance (33 FPS) with superior visual quality, enabling practical deployment in live streaming, video conferencing, and interactive media. The technology particularly benefits content creators, educators, and individuals with speech impairments through accessible avatar communication. While acknowledging potential misuse in unauthorized deepfake creation, we advocate for responsible deployment with user consent verification and integration with detection systems to ensure positive societal impact while minimizing risks.
