Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and Beyond
Xin Qiao, Matteo Poggi, Xing Wei, Pengchao Deng, Yanhui Zhou, Stefano Mattoccia
TL;DR
UD-ToF imaging under TOLED displays suffers from signal attenuation, MPI, and temporal noise. The authors marry neural networks with a time-fractional reaction-diffusion model and a continuous convolution operator in $LFRD^2$, enabling learnable fractional orders and memory-aware depth refinement. The approach achieves state-of-the-art results on UD-ToF benchmarks and improves non-UD tasks like ToF denoising and depth super-resolution, with ablations validating the contributions of fractional dynamics and continuous convolution. This physics-informed, interpretable diffusion framework offers a practical, efficient path to high-quality depth in challenging display-integrated imaging scenarios.
Abstract
Under-display ToF imaging aims to achieve accurate depth sensing through a ToF camera placed beneath a screen panel. However, transparent OLED (TOLED) layers introduce severe degradations-such as signal attenuation, multi-path interference (MPI), and temporal noise-that significantly compromise depth quality. To alleviate this drawback, we propose Learnable Fractional Reaction-Diffusion Dynamics (LFRD2), a hybrid framework that combines the expressive power of neural networks with the interpretability of physical modeling. Specifically, we implement a time-fractional reaction-diffusion module that enables iterative depth refinement with dynamically generated differential orders, capturing long-term dependencies. In addition, we introduce an efficient continuous convolution operator via coefficient prediction and repeated differentiation to further improve restoration quality. Experiments on four benchmark datasets demonstrate the effectiveness of our approach. The code is publicly available at https://github.com/wudiqx106/LFRD2.
