Table of Contents
Fetching ...

Estimates of transport distance in the central limit theorem

Andrei Yu. Zaitsev

TL;DR

The paper strengthens Gaussian-approximation results for sums of independent, bounded $d$-dimensional random vectors by proving a transport-distance bound $\rho(F,Φ)\le c$ with an exponential-cost coupling, extending beyond the classic $W_1$ distance. It introduces and leverages the multivariate class $\mathcal{A}_d(τ)$ (and related $\widetilde{\mathcal{A}}_d(τ)$) tied to cumulant growth and MGFs, and develops a framework based on Cramér transforms and Mills’ ratio to achieve strong Gaussian approximations. The results unify and generalize one-dimensional bounds, connect to Bernstein-type tail control, and discuss potential multivariate extensions, including links to Poisson and infinitely divisible laws. Overall, this work provides a robust transport-geometry perspective on CLT accuracy with sharp, τ-dependent constants.

Abstract

Let $X_1,\ldots,X_n$ be $d$-dimensional independent random vectors bounded with probability one. For simplicity, we assume that they have zero mean values: \begin{equation} \mathbf{P}\{\|X_{j}\|\leτ\}=1,\quad\mathbf{E}\,X_{j}=0,\quad j=1,\ldots, n.\nonumber \end{equation} We study the distribution behavior of the sum $S=X_{1}+\cdots+X_{n}$ as a function of the bounding value $τ$. From the non-uniform Bikelis estimate in the one-dimensional central limit theorem it follows that $$ W_1(F,Φ_σ)\le cτ. $$ with an absolute constant $c$, where $W_1$ is the Kantorovich--Rubinstein--Wasserstein transport distance, $F$ is the distribution of the sum $S$, and $Φ_σ$ is the corresponding normal distribution. The main result of the paper is significantly stronger and more precise. It is claimed that $$ ρ(F,Φ_σ) =\inf\int\exp(|x-y|/cτ)\,dπ(x,y)\le c, $$ where the infimum is taken over all bivariate probability distributions $π$ with marginal distributions $F$ and $Φ_σ$. The result has also been generalized to distributions with sufficiently slowly growing cumulants from the class $\mathcal{A}_{1}(τ)$, introduced in the author's 1986 paper. The possibility of generalizing the result to the multivariate case is discussed.

Estimates of transport distance in the central limit theorem

TL;DR

The paper strengthens Gaussian-approximation results for sums of independent, bounded -dimensional random vectors by proving a transport-distance bound with an exponential-cost coupling, extending beyond the classic distance. It introduces and leverages the multivariate class (and related ) tied to cumulant growth and MGFs, and develops a framework based on Cramér transforms and Mills’ ratio to achieve strong Gaussian approximations. The results unify and generalize one-dimensional bounds, connect to Bernstein-type tail control, and discuss potential multivariate extensions, including links to Poisson and infinitely divisible laws. Overall, this work provides a robust transport-geometry perspective on CLT accuracy with sharp, τ-dependent constants.

Abstract

Let be -dimensional independent random vectors bounded with probability one. For simplicity, we assume that they have zero mean values: \begin{equation} \mathbf{P}\{\|X_{j}\|\leτ\}=1,\quad\mathbf{E}\,X_{j}=0,\quad j=1,\ldots, n.\nonumber \end{equation} We study the distribution behavior of the sum as a function of the bounding value . From the non-uniform Bikelis estimate in the one-dimensional central limit theorem it follows that with an absolute constant , where is the Kantorovich--Rubinstein--Wasserstein transport distance, is the distribution of the sum , and is the corresponding normal distribution. The main result of the paper is significantly stronger and more precise. It is claimed that where the infimum is taken over all bivariate probability distributions with marginal distributions and . The result has also been generalized to distributions with sufficiently slowly growing cumulants from the class , introduced in the author's 1986 paper. The possibility of generalizing the result to the multivariate case is discussed.

Paper Structure

This paper contains 3 sections, 8 theorems, 94 equations.

Key Result

Theorem 1

Let $F=\mathcal{L}(\xi)\in\mathcal{A}_{1}(\tau )$, $\tau>0$, $\mathbf{E}\,\xi =0$. Then there exists an absolute constant $c_1$ such that where $\Phi=\Phi_F$ is the corresponding normal distribution, and the infimum is taken over all two-dimensional probability distributions $\pi$ with marginal distributions $F$ and $\Phi$.

Theorems & Definitions (8)

  • Theorem 1
  • Theorem 2
  • Lemma 1
  • Lemma 2
  • Lemma 3
  • Lemma 4
  • Lemma 5
  • Lemma 6