RealLiFe: Real-Time Light Field Reconstruction via Hierarchical Sparse Gradient Descent
Yijie Deng, Lei Han, Tianpeng Lin, Lin Li, Jinzhi Zhang, Lu Fang
TL;DR
RealLiFe addresses the challenge of real-time light-field reconstruction from sparse views for XR displays. It introduces Hierarchical Sparse Gradient Descent (HSGD), a coarse-to-fine optimization that sparsifies MPI gradients to focus computation on the most informative planes, coupled with an occlusion-aware refinement. The method achieves MPI generation at around 35 FPS and novel-view rendering at about 700 FPS, offering roughly 100x speedups over offline approaches while maintaining or surpassing online-method visual quality (≈2 dB PSNR improvement on several datasets). The combination of plane-sweep-based initial MPI generation, sparse-gradient refinement, and an occlusion-aware module enables robust, real-time light-field reconstruction suitable for naked-eye 3D displays and XR pipelines.
Abstract
With the rise of Extended Reality (XR) technology, there is a growing need for real-time light field generation from sparse view inputs. Existing methods can be classified into offline techniques, which can generate high-quality novel views but at the cost of long inference/training time, and online methods, which either lack generalizability or produce unsatisfactory results. However, we have observed that the intrinsic sparse manifold of Multi-plane Images (MPI) enables a significant acceleration of light field generation while maintaining rendering quality. Based on this insight, we introduce EffLiFe, a novel light field optimization method, which leverages the proposed Hierarchical Sparse Gradient Descent (HSGD) to produce high-quality light fields from sparse view images in real time. Technically, the coarse MPI of a scene is first generated using a 3D CNN, and it is further sparsely optimized by focusing only on important MPI gradients in a few iterations. Nevertheless, relying solely on optimization can lead to artifacts at occlusion boundaries. Therefore, we propose an occlusion-aware iterative refinement module that removes visual artifacts in occluded regions by iteratively filtering the input. Extensive experiments demonstrate that our method achieves comparable visual quality while being 100x faster on average than state-of-the-art offline methods and delivering better performance (about 2 dB higher in PSNR) compared to other online approaches.
