MEP-Net: Generating Solutions to Scientific Problems with Limited Knowledge by Maximum Entropy Principle
Wuyue Yang, Liangrong Peng, Guojie Li, Liu Hong
TL;DR
MEP-Net tackles the challenge of reconstructing probability distributions from limited information by fusing the Maximum Entropy Principle with neural networks. It trains with a two term objective that combines a constraint loss enforcing moment data with a fixed point entropy surrogate to regularize updates, steering the learned distribution toward the maximum entropy solution $p^*(x) ∝ exp(∑_i λ_i f_i(x)) / Z$. Validated on problems including unimodal and multimodal distributions, high dimensional Gaussian mixtures, time dependent Gaussians, the Schloegl chemical master equation, diffusion in confined domains, and the Allen-Cahn gradient flow, achieving high accuracy and stability. The results show that MEP-Net yields physically meaningful distributions in data scarce regimes and opens paths for inverse problems and hybrid modeling.
Abstract
Maximum entropy principle (MEP) offers an effective and unbiased approach to inferring unknown probability distributions when faced with incomplete information, while neural networks provide the flexibility to learn complex distributions from data. This paper proposes a novel neural network architecture, the MEP-Net, which combines the MEP with neural networks to generate probability distributions from moment constraints. We also provide a comprehensive overview of the fundamentals of the maximum entropy principle, its mathematical formulations, and a rigorous justification for its applicability for non-equilibrium systems based on the large deviations principle. Through fruitful numerical experiments, we demonstrate that the MEP-Net can be particularly useful in modeling the evolution of probability distributions in biochemical reaction networks and in generating complex distributions from data.
