Locally Orderless Images for Optimization in Differentiable Rendering
Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi
TL;DR
The paper tackles gradient-sparsity challenges in differentiable rendering by introducing Locally Orderless Images (LOIs), a three-scale image representation built on inner scale $\sigma$, tonal scale $\beta$, and extent scale $\alpha$ to preserve local intensity distributions. It formulates an inverse rendering objective that matches rendered and reference histograms across scales using the Wasserstein distance, enabling robust optimization with standard RGB gradients. The approach is compatible with multiple differentiable renderers (vectorization, path tracing, rasterization) and complements variational optimization, yielding improved parameter recovery in challenging tasks such as shadows, caustics, and high-dimensional scene settings, including real data. LOIs offer a practical, scalable alternative to unreliable image pyramids, enhancing convergence and reliability in inverse rendering applications.
Abstract
Problems in differentiable rendering often involve optimizing scene parameters that cause motion in image space. The gradients for such parameters tend to be sparse, leading to poor convergence. While existing methods address this sparsity through proxy gradients such as topological derivatives or lagrangian derivatives, they make simplifying assumptions about rendering. Multi-resolution image pyramids offer an alternative approach but prove unreliable in practice. We introduce a method that uses locally orderless images, where each pixel maps to a histogram of intensities that preserves local variations in appearance. Using an inverse rendering objective that minimizes histogram distance, our method extends support for sparsely defined image gradients and recovers optimal parameters. We validate our method on various inverse problems using both synthetic and real data.
