Differentiable Rendering: A Survey
Hiroharu Kato, Deniz Beker, Mihai Morariu, Takahiro Ando, Toru Matsuoka, Wadim Kehl, Adrien Gaidon
TL;DR
This survey maps the landscape of differentiable rendering by categorizing methods according to underlying 3D representations (mesh, voxel, point cloud, implicit), detailing analytical and approximated gradient strategies, and highlighting global illumination approaches. It surveys applications from single-view object and human reconstruction to adversarial scenarios and data labeling, and reviews major libraries (TensorFlow Graphics, Kaolin, PyTorch3D, Mitsuba 2) that enable DR research. The paper also discusses evaluation challenges, open problems, and practical limitations in speed, realism, and integration with learning-based methods, arguing that combining inductive 3D priors with differentiable rendering holds promise for scalable, 3D-aware vision systems. Overall, DR is presented as a rapidly evolving field poised to reduce 3D data requirements while enabling robust 3D understanding from 2D observations, with real-time and embedded deployments on the horizon.
Abstract
Deep neural networks (DNNs) have shown remarkable performance improvements on vision-related tasks such as object detection or image segmentation. Despite their success, they generally lack the understanding of 3D objects which form the image, as it is not always possible to collect 3D information about the scene or to easily annotate it. Differentiable rendering is a novel field which allows the gradients of 3D objects to be calculated and propagated through images. It also reduces the requirement of 3D data collection and annotation, while enabling higher success rate in various applications. This paper reviews existing literature and discusses the current state of differentiable rendering, its applications and open research problems.
