Concentration of Empirical First-Passage Times
Rick Bebon, Aljaz Godec
TL;DR
This work develops a non-asymptotic framework to quantify uncertainty in empirical first-passage times for reversible Markov processes. By leveraging a spectral decomposition of the absorbing generator, it yields concentration inequalities that bound deviations of the empirical mean $\overline{\tau}_n$ from the true mean $\langle\tau\rangle$ for any sample size, enabling model-free confidence intervals via $\mathcal{U}_n^\pm$. It further provides two-sided bounds on extreme deviations, $\langle m_n^\pm \rangle$, to capture multi-time-scale dynamics where the mean is not representative. The results apply to Markov jump networks and diffusion in confining potentials, and offer practical guidance for experimental design through minimal sample-size prescriptions, with extensions to beyond-mean statistics and multiple searchers.
Abstract
First-passage properties are central to the kinetics of target-search processes. Theoretical approaches so far primarily focused on predicting first-passage statistics for a given process or model. In practice, however, one faces the reverse problem of inferring first-passage statistics from, typically sub-sampled, experimental or simulation data. Obtaining trustworthy estimates from under-sampled data and unknown underlying dynamics remains a daunting task, and the assessment of the uncertainty is imperative. In this chapter, we highlight recent progress in understanding and controlling finite-sample effects in empirical first-passage times of reversible Markov processes. Precisely, we present concentration inequalities bounding from above the deviations of the sample mean for any sample size from the true mean first-passage time and construct non-asymptotic confidence intervals. Moreover, we present two-sided bounds on the range of fluctuations, i.e, deviations of the expected maximum and minimum from the mean in any given sample, which control uncertainty even in situations where the mean is a priori not a sufficient statistic.
