A new initialisation to Control Gradients in Sinusoidal Neural network

Andrea Combette; Antoine Venaille; Nelly Pustelnik

A new initialisation to Control Gradients in Sinusoidal Neural network

Andrea Combette, Antoine Venaille, Nelly Pustelnik

TL;DR

The paper addresses spectral instability and gradient problems in deep sinusoidal networks used for implicit neural representations. It derives a closed-form initialization by enforcing a fixed-point pre-activation variance and unit gradient flow (sigma_g=1 and sigma_a=0), linking these choices to NTK dynamics and Fourier spectrum behavior. The proposed scheme stabilizes training with depth, reduces spurious high-frequency content, and improves generalization in function fitting, image/video reconstruction, and physics-informed tasks. Overall, the work connects initialization, training dynamics, and spectral properties in sine-activated networks, with broad implications beyond INR contexts.

Abstract

Proper initialisation strategy is of primary importance to mitigate gradient explosion or vanishing when training neural networks. Yet, the impact of initialisation parameters still lacks a precise theoretical understanding for several well-established architectures. Here, we propose a new initialisation for networks with sinusoidal activation functions such as \texttt{SIREN}, focusing on gradients control, their scaling with network depth, their impact on training and on generalization. To achieve this, we identify a closed-form expression for the initialisation of the parameters, differing from the original \texttt{SIREN} scheme. This expression is derived from fixed points obtained through the convergence of pre-activation distribution and the variance of Jacobian sequences. Controlling both gradients and targeting vanishing pre-activation helps preventing the emergence of inappropriate frequencies during estimation, thereby improving generalization. We further show that this initialisation strongly influences training dynamics through the Neural Tangent Kernel framework (NTK). Finally, we benchmark \texttt{SIREN} with the proposed initialisation against the original scheme and other baselines on function fitting and image reconstruction. The new initialisation consistently outperforms state-of-the-art methods across a wide range of reconstruction tasks, including those involving physics-informed neural networks.

A new initialisation to Control Gradients in Sinusoidal Neural network

TL;DR

Abstract

A new initialisation to Control Gradients in Sinusoidal Neural network

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (21)

Theorems & Definitions (15)