Modular Primitives for High-Performance Differentiable Rendering
Samuli Laine, Janne Hellsten, Tero Karras, Yeongho Seol, Jaakko Lehtinen, Timo Aila
TL;DR
The paper presents a high-performance, differentiable renderer built around four modular primitives that exploit deferred shading and hardware rasterization to produce accurate gradients at megapixel resolutions. By integrating with automatic differentiation frameworks via tensor-based inputs and OpenGL-CUDA interop, it achieves fast forward and backward passes, while analytic antialiasing and mipmapped texture filtering provide robust gradient signals for texture and geometry optimization. It demonstrates significant speedups over prior differentiable renderers and validates the approach on facial performance capture, achieving strong geometric correspondence in multi-view, high-resolution data. The work offers a practical, extensible framework for inverse rendering and generative modeling that can be extended to richer appearance models and global illumination in future work.
Abstract
We present a modular differentiable renderer design that yields performance superior to previous methods by leveraging existing, highly optimized hardware graphics pipelines. Our design supports all crucial operations in a modern graphics pipeline: rasterizing large numbers of triangles, attribute interpolation, filtered texture lookups, as well as user-programmable shading and geometry processing, all in high resolutions. Our modular primitives allow custom, high-performance graphics pipelines to be built directly within automatic differentiation frameworks such as PyTorch or TensorFlow. As a motivating application, we formulate facial performance capture as an inverse rendering problem and show that it can be solved efficiently using our tools. Our results indicate that this simple and straightforward approach achieves excellent geometric correspondence between rendered results and reference imagery.
