Event-based Camera Simulation using Monte Carlo Path Tracing with Adaptive Denoising
Yuta Tsuji, Tatsuya Yatagawa, Hiroyuki Kubo, Shigeo Morishima
TL;DR
This work addresses rendering event-based video from noisy frames produced by physics-based Monte Carlo path tracing, where direct frame-by-frame denoising is prohibitively expensive. The authors propose a thresholded weighted local regression that detects brightness changes and triggers events only where likely, using a residual-based criterion with $C^2$ threshold, yielding robust performance on noisy data. They demonstrate that the method achieves comparable or better event-detection quality to a heavier WLR+ESIM baseline while reducing the number of regression evaluations to about $28\%$ of pixels and reducing runtime (e.g., from ~4.5 to ~3 minutes) on 240-frame sequences. This enables efficient simulation of event-based vision in physically-based rendering pipelines and can be extended with non-linear regression or deeper denoising for further improvements.
Abstract
This paper presents an algorithm to obtain an event-based video from noisy frames given by physics-based Monte Carlo path tracing over a synthetic 3D scene. Given the nature of dynamic vision sensor (DVS), rendering event-based video can be viewed as a process of detecting the changes from noisy brightness values. We extend a denoising method based on a weighted local regression (WLR) to detect the brightness changes rather than applying denoising to every pixel. Specifically, we derive a threshold to determine the likelihood of event occurrence and reduce the number of times to perform the regression. Our method is robust to noisy video frames obtained from a few path-traced samples. Despite its efficiency, our method performs comparably to or even better than an approach that exhaustively denoises every frame.
