Fast Training of Sinusoidal Neural Fields via Scaling Initialization
Taesun Yeom, Sangyoon Lee, Jaeho Lee
TL;DR
This work tackles the slow training of sinusoidal neural fields (SNFs) by introducing weight scaling (WS), a simple initialization that multiplies non-final layer weights by a factor $\alpha \ge 1$. Through Fourier/Bessel analyses and empirical neural tangent kernel (eNTK) studies, WS is shown to increase higher-frequency content and yield a better-conditioned optimization path, enabling up to $10\times$ faster convergence across diverse data domains while preserving generalization at moderate $\alpha$. The authors compare WS against standard SNF initialization and multiple baselines, demonstrating robust speedups and practical guidance for selecting $\alpha$ based on workload structure. The results advocate rethinking neural-field initialization as a critical lever for efficiency, with implications for broader NF architectures and activation families. Limitations include the focus on periodic activations and the absence of formal convergence guarantees, pointing to future work on theoretical analysis and extension to other NF paradigms.
Abstract
Neural fields are an emerging paradigm that represent data as continuous functions parameterized by neural networks. Despite many advantages, neural fields often have a high training cost, which prevents a broader adoption. In this paper, we focus on a popular family of neural fields, called sinusoidal neural fields (SNFs), and study how it should be initialized to maximize the training speed. We find that the standard initialization scheme for SNFs -- designed based on the signal propagation principle -- is suboptimal. In particular, we show that by simply multiplying each weight (except for the last layer) by a constant, we can accelerate SNF training by 10$\times$. This method, coined $\textit{weight scaling}$, consistently provides a significant speedup over various data domains, allowing the SNFs to train faster than more recently proposed architectures. To understand why the weight scaling works well, we conduct extensive theoretical and empirical analyses which reveal that the weight scaling not only resolves the spectral bias quite effectively but also enjoys a well-conditioned optimization trajectory.
