Data-driven Learning of Interaction Laws in Multispecies Particle Systems with Gaussian Processes: Convergence Theory and Applications
Jinchao Feng, Charles Kulick, Sui Tang
TL;DR
The paper develops a data-driven Gaussian process framework to learn interaction kernels in multispecies particle systems, extending prior single-species theory to two-type agent dynamics with intra- and inter-species kernels $\{\phi^{pq}\}$. It establishes a rigorous statistical theory, including identifiability via a coercivity condition, a kernel-Ridge regression interpretation, and non-asymptotic error bounds for reconstruction and uncertainty quantification. The approach is validated through numerical experiments on two-species aggregation and predator–prey models, showing accurate kernel recovery and reliable trajectory predictions, with data efficiency and transferability to larger systems. This work advances multiscale modeling by providing a principled, uncertainty-aware method to infer microscopic interaction laws from macroscopic trajectory data and connects Bayesian GP learning with inverse-problem theory.
Abstract
We develop a Gaussian process framework for learning interaction kernels in multi-species interacting particle systems from trajectory data. Such systems provide a canonical setting for multiscale modeling, where simple microscopic interaction rules generate complex macroscopic behaviors. While our earlier work established a Gaussian process approach and convergence theory for single-species systems, and later extended to second-order models with alignment and energy-type interactions, the multi-species setting introduces new challenges: heterogeneous populations interact both within and across species, the number of unknown kernels grows, and asymmetric interactions such as predator-prey dynamics must be accommodated. We formulate the learning problem in a nonparametric Bayesian setting and establish rigorous statistical guarantees. Our analysis shows recoverability of the interaction kernels, provides quantitative error bounds, and proves statistical optimality of posterior estimators, thereby unifying and generalizing previous single-species theory. Numerical experiments confirm the theoretical predictions and demonstrate the effectiveness of the proposed approach, highlighting its advantages over existing kernel-based methods. This work contributes a complete statistical framework for data-driven inference of interaction laws in multi-species systems, advancing the broader multiscale modeling program of connecting microscopic particle dynamics with emergent macroscopic behavior.
