A Hybrid Neural Network-Finite Element Method for the Viscous-Plastic Sea-Ice Model
Nils Margenberg, Carolin Mehlmann
TL;DR
The paper tackles the computational bottleneck of solving the viscous-plastic sea-ice VP model by coupling a coarse finite element discretization with patch-local neural network corrections learned from high-resolution data. The Hybrid NN-FEM framework enriches the coarse solution in a refined auxiliary space, applying localized, batched neural corrections after a Newton-Krylov solve to accelerate convergence and better capture linear kinematic features without a full fine-grid solve. Extensive experiments show that, on an 8 km grid, the method achieves accuracy and deformation statistics close to a 4 km reference, with Newton iterations and runtime substantially reduced and neural overhead remaining negligible. The work demonstrates robust, scalable, and locality-driven learning-based enhancements for multiscale climate simulations, with ablations highlighting the importance of both state- and residual-information in the network inputs. Overall, NN-FEM offers a practical path to faster, high-fidelity sea-ice modeling on coarse grids, enabling more efficient climate simulations while preserving key deformation patterns.
Abstract
We present an efficient hybrid Neural Network-Finite Element Method (NN-FEM) for solving the viscous-plastic (VP) sea-ice model. The VP model is widely used in climate simulations to represent large-scale sea-ice dynamics. However, the strong nonlinearity introduced by the material law makes VP solvers computationally expensive, with the cost per degree of freedom increasing rapidly under mesh refinement. High spatial resolution is particularly required to capture narrow deformation bands known as linear kinematic features in viscous-plastic models. To improve computational efficiency in simulating such fine-scale deformation features, we propose to enrich coarse-mesh finite element approximations with fine-scale corrections predicted by neural networks trained with high-resolution simulations. The neural network operates locally on small patches of grid elements, which is efficient due to its relatively small size and parallel applicability across grid patches. An advantage of this local approach is that it generalizes well to different right-hand sides and computational domains, since the network operates on small subregions rather than learning details tied to a specific choice of boundary conditions, forcing, or geometry. The numerical examples quantify the runtime and evaluate the error for this hybrid approach with respect to the simulation of sea-ice deformations. Applying the learned network correction enables coarser-grid simulations to achieve qualitatively similar accuracy at approximately 11 times lower computational cost relative to the high-resolution reference simulations. Moreover, the learned correction accelerates the Newton solver by up to 10% compared to runs without the correction at the same mesh resolution.
