Preconditioners for the Stochastic Training of Neural Fields
Shin-Fang Chng, Hemanth Saratchandran, Simon Lucey
TL;DR
Neural fields can be trained more quickly in stochastic settings by applying curvature-aware diagonal preconditioners. The work shows that Adam effectively behaves as a diagonal Gauss-Newton preconditioner, and that activating networks with sine, Gaussian, or wavelet functions enables ESGD and related preconditioners to substantially accelerate training compared to Adam, across image reconstruction, 3D occupancy, and NeRF tasks. Theoretical results connect Hessian-vector structure to activation type, and empirical results confirm activation-dependent benefits, with limitations for ReLU-PE where Adam remains superior. This provides a practical, activation-aware framework for speeding up neural field optimization in large-scale stochastic regimes.
Abstract
Neural fields encode continuous multidimensional signals as neural networks, enabling diverse applications in computer vision, robotics, and geometry. While Adam is effective for stochastic optimization, it often requires long training times. To address this, we explore alternative optimization techniques to accelerate training without sacrificing accuracy. Traditional second-order methods like L-BFGS are unsuitable for stochastic settings. We propose a theoretical framework for training neural fields with curvature-aware diagonal preconditioners, demonstrating their effectiveness across tasks such as image reconstruction, shape modeling, and Neural Radiance Fields (NeRF).
