Viscoelasticty with physics-augmented neural networks: Model formulation and training methods without prescribed internal variables
Max Rosenkranz, Karl A. Kalina, Jörg Brummund, WaiChing Sun, Markus Kästner
TL;DR
This work tackles data-driven viscoelastic constitutive modeling by embedding physics into neural networks through generalized standard materials. It develops a dual-potential QQ model with a convex free-energy ψ and a convex-dissipation φ learned via fully input convex (FICNN) and partially input convex (PICNN) networks, ensuring thermodynamic consistency and isotropy. A key contribution is a recurrent-cell training approach (LSTM-based) that generates internal variables on the fly, enabling calibration from stress–strain paths alone and avoiding explicit internal-variable labels. Across invariant-based formulations, it demonstrates superior extrapolation and data efficiency relative to coordinate-based inputs, and shows that an auxiliary RNN offers the best balance of accuracy and scalability for large datasets. The method broadens the applicability of NN-based constitutive modeling to viscoelastic materials and potentially to elastoplastic and multiscale problems, with clear pathways for extending to more complex internal-variable sets or finite-strain regimes.
Abstract
We present an approach for the data-driven modeling of nonlinear viscoelastic materials at small strains which is based on physics-augmented neural networks (NNs) and requires only stress and strain paths for training. The model is built on the concept of generalized standard materials and is therefore thermodynamically consistent by construction. It consists of a free energy and a dissipation potential, which can be either expressed by the components of their tensor arguments or by a suitable set of invariants. The two potentials are described by fully/partially input convex neural networks. For training of the NN model by paths of stress and strain, an efficient and flexible training method based on a recurrent cell, particularly a long short-term memory cell, is developed to automatically generate the internal variable(s) during the training process. The proposed method is benchmarked and thoroughly compared with existing approaches. These include a method that obtains the internal variable by integrating the evolution equation over the entire sequence, while the other method uses an an auxiliary feedforward neural network for the internal variable(s). Databases for training are generated by using a conventional nonlinear viscoelastic reference model, where 3D and 2D plane strain data with either ideal or noisy stresses are generated. The coordinate-based and the invariant-based formulation are compared and the advantages of the latter are demonstrated. Afterwards, the invariant-based model is calibrated by applying the three training methods using ideal or noisy stress data. All methods yield good results, but differ in computation time and usability for large data sets. The presented training method based on a recurrent cell turns out to be particularly robust and widely applicable and thus represents a promising approach for the calibration of other types of models as well.
