Random Weight Factorization Improves the Training of Continuous Neural Representations
Sifan Wang, Hanwen Wang, Jacob H. Seidman, Paris Perdikaris
TL;DR
This work addresses the difficulty of training coordinate-based neural representations due to spectral bias and poor initializations. It introduces random weight factorization, rewriting weights as $\mathbf{w}^{(k,l)} = s^{(k,l)} \mathbf{v}^{(k,l)}$ and $\mathbf{W}^{(l)} = \operatorname{diag}(\mathbf{s}^{(l)}) \mathbf{V}^{(l)}$, which induces per-neuron adaptive learning rates and reshapes the loss landscape to facilitate optimization. The approach yields consistent, robust improvements across image regression, 3D shape representation, CT, inverse rendering, PINNs, and operator learning (e.g., DeepONet), while incurring minimal overhead. This suggests a versatile, drop-in enhancement for continuous neural representations with broad practical impact in vision, graphics, and scientific computing.
Abstract
Continuous neural representations have recently emerged as a powerful and flexible alternative to classical discretized representations of signals. However, training them to capture fine details in multi-scale signals is difficult and computationally expensive. Here we propose random weight factorization as a simple drop-in replacement for parameterizing and initializing conventional linear layers in coordinate-based multi-layer perceptrons (MLPs) that significantly accelerates and improves their training. We show how this factorization alters the underlying loss landscape and effectively enables each neuron in the network to learn using its own self-adaptive learning rate. This not only helps with mitigating spectral bias, but also allows networks to quickly recover from poor initializations and reach better local minima. We demonstrate how random weight factorization can be leveraged to improve the training of neural representations on a variety of tasks, including image regression, shape representation, computed tomography, inverse rendering, solving partial differential equations, and learning operators between function spaces.
