Kantorovich Regression Analysis of Random Distributions with Mixed Predictors

Kaheon Kim; Changbo Zhu

Kantorovich Regression Analysis of Random Distributions with Mixed Predictors

Kaheon Kim, Changbo Zhu

Abstract

We study regression problems with distribution-valued responses and mixed distributional and Euclidean predictors. In quadratic cost, the negative gradient of the Kantorovich potential represents, at each source location, the displacement to its matched location under the optimal transport map. By constructing potentials from the Wasserstein barycenter to individual distributions, the proposed Kantorovich regression model approximates the response displacement field as a sum of predictor displacement fields, each adjusted by a functional parameter. Owing to the linear structure, Euclidean predictors can enter as scaling coefficients of $c$-concave parameter potentials. We characterize functional parameter classes ensuring intrinsicness of the model, establish asymptotic theory through uniform convergence of the empirical Wasserstein loss, and derive Gâteaux derivatives leading to first-order optimization algorithms. Real data applications include a mixed-predictor analysis of housing price distributions and an analysis of two-dimensional temperature distributions, demonstrating the flexibility and interpretability of the proposed framework.

Kantorovich Regression Analysis of Random Distributions with Mixed Predictors

Abstract

-concave parameter potentials. We characterize functional parameter classes ensuring intrinsicness of the model, establish asymptotic theory through uniform convergence of the empirical Wasserstein loss, and derive Gâteaux derivatives leading to first-order optimization algorithms. Real data applications include a mixed-predictor analysis of housing price distributions and an analysis of two-dimensional temperature distributions, demonstrating the flexibility and interpretability of the proposed framework.

Paper Structure (15 sections, 11 theorems, 56 equations, 11 figures)

This paper contains 15 sections, 11 theorems, 56 equations, 11 figures.

Introduction
Preliminary
Optimal transport
Wasserstein barycenter
Kantorovich Regression
Single distributional predictor
Multiple distributional predictors
Mixed predictor
Asymptotic Theory
Computational Algorithms
Numerical Studies
1D synthetic data
1D real data
2D real data
Conclusion

Key Result

Lemma 3.1

Given a $c$-concave function $\phi \in \mathcal{C}^2(\Omega)$ such that $\nabla \phi(x) \neq 0$ for any $x \in \Omega$ and $g\in \mathcal{C}^1(\mathbb{R})$, we let $R(x) := x - g (x) \nabla \phi (x)$. If $R$ is cyclically monotone, then $g$ is constant on each connected component of each level set $

Figures (11)

Figure 1: Illustration of the adjustment on $\phi$ by $f$. Column 1 (bottom row) shows the Kantorovich potential $\phi:[0, 1]^2 \rightarrow \mathbb{R}$ as a heat map, together with the gradient field $\nabla \phi$ drawn as arrows. From column 2 onward, the top row displays the derivatives $f'$ of the functional parameter $f$, while the bottom row shows $f\circledcirc\phi$ as heat maps with the corresponding gradient field $\nabla (f\circledcirc\phi)$ (arrows).
Figure 2: 1D illustration of KR model with non-decreasing concave functional parameter.
Figure 3: 1D illustration of KR with non-increasing convex functional parameter.
Figure 4: 2D illustration of KR with non-decreasing concave functional parameter.
Figure 5: 2D illustration of KR with non-increasing convex functional parameter.
...and 6 more figures

Theorems & Definitions (13)

Lemma 3.1
Theorem 3.2
Corollary 3.3
Remark 3.4
Remark 3.5
Corollary 3.6
Theorem 3.7
Corollary 3.8
Theorem 4.3
Corollary 4.4
...and 3 more

Kantorovich Regression Analysis of Random Distributions with Mixed Predictors

Abstract

Kantorovich Regression Analysis of Random Distributions with Mixed Predictors

Authors

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (13)