Table of Contents
Fetching ...

Automated construction of effective potential via algorithmic implicit bias

Xingjie Helen Li, Molei Tao

TL;DR

The paper addresses learning effective potentials at user-selected scales for multiscale objectives by exploiting the implicit bias of large-step gradient descent, which trades small-scale structure for stochasticity. It introduces a self-learning framework that simulates a damped mechanical system at a chosen step size, estimates the unresolved-scale covariance to impose a fluctuation–dissipation balance, and recovers an effective potential $U_k$ suitable for surrogate Hamiltonian or Langevin models. The approach is validated through one- and two-dimensional numerical experiments, including anisotropic noise calibration and multi-well potentials, demonstrating accurate reproduction of equilibrium statistics, mean paths, and auto-correlation functions under the surrogate dynamics. This provides a scalable method to construct reduced-order, scale-aware models with practical impact in accelerating simulations and improving interpretability in multiscale physical and data-driven problems.

Abstract

We introduce a novel approach for decomposing and learning every scale of a given multiscale objective function in $\mathbb{R}^d$, where $d\ge 1$. This approach leverages a recently demonstrated implicit bias of the optimization method of gradient descent by Kong and Tao, which enables the automatic generation of data that nearly follow Gibbs distribution with an effective potential at any desired scale. One application of this automated effective potential modeling is to construct reduced-order models. For instance, a deterministic surrogate Hamiltonian model can be developed to substantially soften the stiffness that bottlenecks the simulation, while maintaining the accuracy of phase portraits at the scale of interest. Similarly, a stochastic surrogate model can be constructed at a desired scale, such that both its equilibrium and out-of-equilibrium behaviors (characterized by auto-correlation function and mean path) align with those of a damped mechanical system with the original multiscale function being its potential. The robustness and efficiency of our proposed approach in multi-dimensional scenarios have been demonstrated through a series of numerical experiments. A by-product of our development is a method for anisotropic noise estimation and calibration. More precisely, Langevin model of stochastic mechanical systems may not have isotropic noise in practice, and we provide a systematic algorithm to quantify its covariance matrix without directly measuring the noise. In this case, the system may not admit closed form expression of its invariant distribution either, but with this tool, we can design friction matrix appropriately to calibrate the system so that its invariant distribution has a closed form expression of Gibbs.

Automated construction of effective potential via algorithmic implicit bias

TL;DR

The paper addresses learning effective potentials at user-selected scales for multiscale objectives by exploiting the implicit bias of large-step gradient descent, which trades small-scale structure for stochasticity. It introduces a self-learning framework that simulates a damped mechanical system at a chosen step size, estimates the unresolved-scale covariance to impose a fluctuation–dissipation balance, and recovers an effective potential suitable for surrogate Hamiltonian or Langevin models. The approach is validated through one- and two-dimensional numerical experiments, including anisotropic noise calibration and multi-well potentials, demonstrating accurate reproduction of equilibrium statistics, mean paths, and auto-correlation functions under the surrogate dynamics. This provides a scalable method to construct reduced-order, scale-aware models with practical impact in accelerating simulations and improving interpretability in multiscale physical and data-driven problems.

Abstract

We introduce a novel approach for decomposing and learning every scale of a given multiscale objective function in , where . This approach leverages a recently demonstrated implicit bias of the optimization method of gradient descent by Kong and Tao, which enables the automatic generation of data that nearly follow Gibbs distribution with an effective potential at any desired scale. One application of this automated effective potential modeling is to construct reduced-order models. For instance, a deterministic surrogate Hamiltonian model can be developed to substantially soften the stiffness that bottlenecks the simulation, while maintaining the accuracy of phase portraits at the scale of interest. Similarly, a stochastic surrogate model can be constructed at a desired scale, such that both its equilibrium and out-of-equilibrium behaviors (characterized by auto-correlation function and mean path) align with those of a damped mechanical system with the original multiscale function being its potential. The robustness and efficiency of our proposed approach in multi-dimensional scenarios have been demonstrated through a series of numerical experiments. A by-product of our development is a method for anisotropic noise estimation and calibration. More precisely, Langevin model of stochastic mechanical systems may not have isotropic noise in practice, and we provide a systematic algorithm to quantify its covariance matrix without directly measuring the noise. In this case, the system may not admit closed form expression of its invariant distribution either, but with this tool, we can design friction matrix appropriately to calibrate the system so that its invariant distribution has a closed form expression of Gibbs.
Paper Structure (23 sections, 1 theorem, 42 equations, 6 figures, 1 algorithm)

This paper contains 23 sections, 1 theorem, 42 equations, 6 figures, 1 algorithm.

Key Result

Theorem 3.1

Consider a stochastic Langevin dynamics Langevin_sys2 with a positive scalar friction constant $\gamma$ and a constant diffusion coefficient matrix $\Sigma$, then $\Sigma$ satisfies where the expectation is taken with respect to the invariant distribution of $(\bm{q},\,\bm{p})$.

Figures (6)

  • Figure 1: The illustration of a one-dimensional potential function $V=V_0(q)+V_1(q)+V_2(q)$ with three scales: $V_0=\frac{q^2}{2}\sim O(1)$, $V_1=0.1\times \sin(\frac{q}{0.1})\sim O(0.1)$ and $V_2=0.01\times \sin(\frac{q}{0.01})\sim O(0.01)$.
  • Figure 6: Zoomed-in plot of $V(q)=\frac{(q-\pi/2)^2}{4}+\sum_{i=1}^{N}\frac{\cos(i^2\, q)}{i^2}$ with different values of $N$.
  • Figure 7: Learning potentials $U_k(q)$ at different scales. The exact potential is $V(q):=\frac{(q-\pi/2)^2}{4}+\sum_{i=1}^{20}\cos(i^2\times q)/i^2$.
  • Figure 8: Fig (a): Comparison on the surrogate Hamiltonian \ref{['surr_Hamiltonian']} via $U_0(q)$ with $\delta = 0.1309$ and via $V(q)$ with $h=1e-3$. Fig (b)-(d): Comparison on the surrogate Langevin \ref{['Surr_Langevin']} via $U_0(q)$ with $\delta = 0.1309$ and via $V(q)$\ref{['cos_pot']} for $N=20$ as well as the truncated version $N=2$ with $h = 1e-3$. The initial distributions for $q\sim \mathcal{U}(-1/2,\,1/2)+0.38$ and $p\sim \mathcal{U}(-1/2,\,1/2)$.
  • Figure 9: Learning results of macroscopic $U_0$. The exact potential is given in \ref{['2d_quadratic']} and the step size is $\delta = 0.05$. Left: the green data is generated using scalar friction $\gamma I$ only; and the red data is generated by the two-stage Algorithm \ref{['Algorithm_summary']}. Right: zoomed-in view of the two-stage learning results. In the legend, ULD denotes the under damping-loaded dynamics \ref{['Newton_sys']}.
  • ...and 1 more figures

Theorems & Definitions (5)

  • Remark 3.1
  • Remark 3.2
  • Theorem 3.1
  • proof
  • Remark 3.3