Digital Twin Generation from Visual Data: A Survey
Andrew Melnik, Benjamin Alt, Giang Nguyen, Artur Wilkowski, Maciej Stefańczyk, Qirui Wu, Sinan Harms, Helge Rhodin, Manolis Savva, Michael Beetz
TL;DR
This survey analyzes how to generate immersive indoor Digital Twins from visual data, covering geometry representations (Mesh, CAD, 3DGS) and the end-to-end pipelines from video to 3D reconstructions. It surveys traditional 2D-to-3D workflows (COLMAP SfM, SLAM) and modern 3DGS-based methods, including single-image and sparse-view strategies, with emphasis on handling input variety (monocular, non-calibrated, fisheye, LiDAR/RGB-D) and diffusion-guided domain-free scene generation. A core thread is the temporal dimension, comparing implicit neural, voxel-based, and planar-factorized approaches to model dynamic scenes, alongside regularization and efficiency considerations for robotics applications. The paper also covers lighting, reflections, articulated objects, and physics integration, highlighting scene description formats (URDF, USD) and simulators (MuJoCo, PyBullet, Gazebo, Unreal) for physics-based DTs. Collectively, it identifies practical pathways and research directions for scalable, physically plausible, and semantically rich digital twins driven by visual data, with future opportunities in per-room lighting, diffusion-guided 3D generation, and integrated physics simulation.
Abstract
This survey explores recent developments in generating digital twins from videos. Such digital twins can be used for robotics application, media content creation, or design and construction works. We analyze various approaches, including 3D Gaussian Splatting, generative in-painting, semantic segmentation, and foundation models highlighting their advantages and limitations. Additionally, we discuss challenges such as occlusions, lighting variations, and scalability, as well as potential future research directions. This survey aims to provide a comprehensive overview of state-of-the-art methodologies and their implications for real-world applications. Awesome list: https://github.com/ndrwmlnk/awesome-digital-twins
