Transfer Operators from Batches of Unpaired Points via Entropic Transport Kernels
Florian Beier, Hancheng Bi, Clément Sarrazin, Bernhard Schmitzer, Gabriele Steidl
TL;DR
This work addresses learning the joint distribution of $(X,Y)$ from batches where within-batch pairings are unknown. It derives a maximum-likelihood permutation functional and an efficient approximate surrogate, and proves Γ-convergence so the true density $p$ is recoverable as the number of batches grows. A nonparametric hypothesis class built from entropic optimal transport kernels enforces probabilistic transfer-density constraints and stabilizes the learning of transfer operators from data, with a discrete problem solved by a modified EMML algorithm that preserves convergence. Numerical experiments on dynamical-systems scenarios and particle colocalization demonstrate the method’s ability to infer smoothed transfer operators and reveal coherent structures from unpaired observations. The approach provides a scalable framework for transferring structural information in systems where pairings are imperfect or unavailable, with potential applications in physics, imaging, and data-driven dynamical analysis.
Abstract
In this paper, we are concerned with estimating the joint probability of random variables $X$ and $Y$, given $N$ independent observation blocks $(\boldsymbol{x}^i,\boldsymbol{y}^i)$, $i=1,\ldots,N$, each of $M$ samples $(\boldsymbol{x}^i,\boldsymbol{y}^i) = \bigl((x^i_j, y^i_{σ^i(j)}) \bigr)_{j=1}^M$, where $σ^i$ denotes an unknown permutation of i.i.d. sampled pairs $(x^i_j,y_j^i)$, $j=1,\ldots,M$. This means that the internal ordering of the $M$ samples within an observation block is not known. We derive a maximum-likelihood inference functional, propose a computationally tractable approximation and analyze their properties. In particular, we prove a $Γ$-convergence result showing that we can recover the true density from empirical approximations as the number $N$ of blocks goes to infinity. Using entropic optimal transport kernels, we model a class of hypothesis spaces of density functions over which the inference functional can be minimized. This hypothesis class is particularly suited for approximate inference of transfer operators from data. We solve the resulting discrete minimization problem by a modification of the EMML algorithm to take addional transition probability constraints into account and prove the convergence of this algorithm. Proof-of-concept examples demonstrate the potential of our method.
