Conditional updates of neural network weights for increased out of training performance
Jan Saynisch-Wagner, Saran Rajendran Sari
TL;DR
The paper tackles the problem of neural networks failing outside their training distribution in climate and geoscience contexts. It introduces a weight-prediction framework that finetunes a parent network on training data, regresses weight anomalies on informative predictors, and extrapolates to application data to generate child networks. Across three climate-inspired use cases—temporal tipping of the AMOC, spatial density estimation, and cross-domain wind-velocity uncertainty—the approach generally improves out-of-distribution performance, with notable gains in tipping scenarios and deep-ocean extrapolation, and meaningful but variable gains in cross-domain uncertainty. The work emphasizes that this extrapolation via weight-regression offers a promising path to making neural networks adaptive to unseen regimes, while also acknowledging stochasticity and stability challenges that warrant further research and ensemble strategies.
Abstract
This study proposes a method to enhance neural network performance when training data and application data are not very similar, e.g., out of distribution problems, as well as pattern and regime shifts. The method consists of three main steps: 1) Retrain the neural network towards reasonable subsets of the training data set and note down the resulting weight anomalies. 2) Choose reasonable predictors and derive a regression between the predictors and the weight anomalies. 3) Extrapolate the weights, and thereby the neural network, to the application data. We show and discuss this method in three use cases from the climate sciences, which include successful temporal, spatial and cross-domain extrapolations of neural networks.
