Mixtures of ensembles: System separation and identification via optimal transport
Filip Elvander, Isabel Haasler
TL;DR
The paper tackles the challenge of identifying and separating multiple heterogeneous subpopulations from aggregate observations while inferring each subpopulation's dynamics. It introduces an optimal transport-based framework that jointly decomposes the population into $K$ ensembles and identifies their dynamical systems via a bi-convex optimization solved by block coordinate descent, with convergence guarantees. Empirical results show the method attains close-to-oracle performance, maintaining high ensemble classification accuracy even under substantial noise. This approach enables robust inference in domains where only aggregate, not individual, trajectories are observable and can impact fields ranging from crowd dynamics to single-cell biology.
Abstract
Crowd dynamics and many large biological systems can be described as populations of agents or particles, which can only be observed on aggregate population level. Identifying the dynamics of agents is crucial for understanding these large systems. However, the population of agents is typically not homogeneous, and thus the aggregate observations consist of the superposition of multiple ensembles each governed by individual dynamics. In this work, we propose an optimal transport framework to jointly separate the population into several ensembles and identify each ensemble's dynamical system, based on aggregate observations of the population. We propose a bi-convex optimization problem, which we solve using a block coordinate descent with convergence guarantees. In numerical experiments, we demonstrate that the proposed approach exhibits close-to-oracle performance also in noisy settings, yielding accurate estimates of both the ensembles and the parameters governing their dynamics.
