$α$-divergence Improves the Entropy Production Estimation via Machine Learning
Euijoon Kwon, Yongjoo Baek
TL;DR
Problem: estimating trajectory-level entropy production (EP) from stochastic trajectories. Approach: introduce the $α$-NEEP, replacing the KL-based loss of the original NEEP with a variational $α$-divergence loss $L_α$, forming a family parameterized by $α$. Key finding: for $-1 < α ≤ 0$, and especially at $α = -0.5$, the estimator is more robust to strong nonequilibrium driving and sampling noise. Rationale: a simple Gaussian model provides analytic insight showing why $α = -0.5$ minimizes bias in the loss landscape and gradient fluctuations. Impact: broadens applicability of EP estimation in nonequilibrium thermodynamics and guides loss design for trajectory-level stochastic thermodynamic quantities.
Abstract
Recent years have seen a surge of interest in the algorithmic estimation of stochastic entropy production (EP) from trajectory data via machine learning. A crucial element of such algorithms is the identification of a loss function whose minimization guarantees the accurate EP estimation. In this study, we show that there exists a host of loss functions, namely those implementing a variational representation of the $α$-divergence, which can be used for the EP estimation. By fixing $α$ to a value between $-1$ and $0$, the $α$-NEEP (Neural Estimator for Entropy Production) exhibits a much more robust performance against strong nonequilibrium driving or slow dynamics, which adversely affects the existing method based on the Kullback-Leibler divergence ($α= 0$). In particular, the choice of $α= -0.5$ tends to yield the optimal results. To corroborate our findings, we present an exactly solvable simplification of the EP estimation problem, whose loss function landscape and stochastic properties give deeper intuition into the robustness of the $α$-NEEP.
