Reinforcement learning-based adaptive time-integration for nonsmooth dynamics
David Michael Riley, Alexandros Stathas, Diego Gutiérrez-Oribio, Ioannis Stefanou
TL;DR
The paper addresses the computational challenge of integrating nonsmooth dynamical systems by learning adaptive time-stepping policies with reinforcement learning. It adopts Truncated Quantile Critics (TQC) for continuous actions to select time steps, integrating with a variational-inequality framework for constraint enforcement and using Bathe time-stepping to preserve stability. Through three case studies—sliding-mode control, a Chua-like electrical circuit, and a frictional seismic fault—it demonstrates substantial speedups (up to an order of magnitude in some scenarios) while maintaining acceptable accuracy and showing generalization across discretizations, with transfer learning further boosting performance. The work establishes a general, data-driven alternative to heuristic and PI-based time stepping for complex nonsmooth systems, enabling faster parametric studies and potential real-time simulations.
Abstract
Numerical time integration is fundamental to the simulation of initial and boundary value problems. Traditionally, time integration schemes require adaptive time-stepping to ensure computational speed and sufficient accuracy. Although these methods are based on mathematical derivations related to the order of accuracy for the chosen integrator, they also rely on heuristic development to determine optimal time steps. In this work, we use an alternative approach based on Reinforcement Learning (RL) to select the optimal time step for any time integrator method, balancing computational speed and accuracy. To explore the potential of our RL-based adaptive time-stepping approach, we choose a challenging model problem involving set-valued frictional instabilities at various spatiotemporal scales. This problem demonstrates the robustness of our strategy in handling nonsmooth problems, which present a demanding scenario for numerical integration. Specifically, we apply RL to the simulation of a seismic fault with Coulomb friction. Our findings indicate that RL can learn an optimal strategy for time integration, achieving up to a fourfold speed-up. Our RL-based adaptive integrator offers a new approach for time integration in various other problems in mechanics.
