Input convex neural networks: universal approximation theorem and implementation for isotropic polyconvex hyperelastic energies
Gian-Luca Geuken, Patrick Kurzeja, David Wiedemann, Jörn Mosler
TL;DR
The paper tackles learning isotropic hyperelastic energies under stringent physical and mathematical constraints, notably frame-indifference and polyconvexity, while retaining universal approximation capabilities. It introduces the Convex Signed Singular Value Neural Network (CSSV-NN), which operates on signed singular values and enforces Pi_3-invariance by averaging over permutations, with an ICNN-based convex architecture ensuring polyconvexity. A universal approximation theorem is proved for frame-indifferent, isotropic polyconvex energies, built on convex-ICNN results extended to singular-value representations. Numerical results demonstrate CSSV-NN’s superior accuracy for both polyconvex and non-polyconvex energies and its ability to approximate polyconvex hulls, including regimes with linear energy growth, underscoring its potential as a physics-informed surrogate for data-driven constitutive modeling.
Abstract
This paper presents a novel framework of neural networks for isotropic hyperelasticity that enforces necessary physical and mathematical constraints while simultaneously satisfying the universal approximation theorem. The two key ingredients are an input convex network architecture and a formulation in the elementary polynomials of the signed singular values of the deformation gradient. In line with previously published networks, it can rigorously capture frame-indifference and polyconvexity - as well as further constraints like balance of angular momentum and growth conditions. However and in contrast to previous networks, a universal approximation theorem for the proposed approach is proven. To be more explicit, the proposed network can approximate any frame-indifferent, isotropic polyconvex energy (provided the network is large enough). This is possible by working with a sufficient and necessary criterion for frame-indifferent, isotropic polyconvex functions. Comparative studies with existing approaches identify the advantages of the proposed method, particularly in approximating non-polyconvex energies as well as computing polyconvex hulls.
