AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems
Alexey Kotcov, Maria Dronova, Vladislav Cheremnykh, Sausar Karaf, Dzmitry Tsetserukou
TL;DR
AirNeRF tackles the challenge of fast, autonomous 3D human avatar reconstruction for VR and telepresence by combining drone-based data capture, NeRF-based view synthesis, semantic segmentation, and automatic rigging to produce lightweight, rigged avatars. The pipeline leverages Nerfacto for efficient NeRF training, achieving about 6 minutes of training on 300 images and generating FBX avatars around 3 MB with 22 joints and 36k–51k triangles, suitable for modern AR/VR engines. The approach demonstrates data-efficient, photogrammetry-free 3D human reconstruction in unconstrained settings and points toward future enhancements with Gaussian Splatting for even richer geometry representations. This work enables accessible, high-fidelity digital humans for VR/AR, Metaverse, and industrial applications such as BIM and telepresence.
Abstract
In the rapidly evolving landscape of digital content creation, the demand for fast, convenient, and autonomous methods of crafting detailed 3D reconstructions of humans has grown significantly. Addressing this pressing need, our AirNeRF system presents an innovative pathway to the creation of a realistic 3D human avatar. Our approach leverages Neural Radiance Fields (NeRF) with an automated drone-based video capturing method. The acquired data provides a swift and precise way to create high-quality human body reconstructions following several stages of our system. The rigged mesh derived from our system proves to be an excellent foundation for free-view synthesis of dynamic humans, particularly well-suited for the immersive experiences within gaming and virtual reality.
