Deep Fréchet Regression

Su I Iao; Yidong Zhou; Hans-Georg Müller

Deep Fréchet Regression

Su I Iao, Yidong Zhou, Hans-Georg Müller

TL;DR

This paper develops Deep Fréchet Regression (DFR) for regressing metric-space–valued responses on high-dimensional Euclidean predictors. The framework combines deep neural networks to learn low-dimensional manifold representations with local Fréchet regression to map back to the original metric space, using ISOMAP for manifold estimation and an errors-in-variables treatment. The authors establish convergence rates for the DNN component under dependent sub-Gaussian noise with bias, extend local Fréchet regression to multivariate predictors with predictor errors, and derive the overall rate for the DFR estimator. They demonstrate by simulations and real-data applications (NYC taxi networks and age-at-death distributions) that DFR outperforms existing methods, with robustness to manifold approximation and practical interpretability. The work contributes a versatile, scalable approach for non-Euclidean responses, combining neural and geometric tools with rigorous statistical theory.

Abstract

Advancements in modern science have led to the increasing availability of non-Euclidean data in metric spaces. This paper addresses the challenge of modeling relationships between non-Euclidean responses and multivariate Euclidean predictors. We propose a flexible regression model capable of handling high-dimensional predictors without imposing parametric assumptions. Two primary challenges are addressed: the curse of dimensionality in nonparametric regression and the absence of linear structure in general metric spaces. The former is tackled using deep neural networks, while for the latter we demonstrate the feasibility of mapping the metric space where responses reside to a low-dimensional Euclidean space using manifold learning. We introduce a reverse mapping approach, employing local Fréchet regression, to map the low-dimensional manifold representations back to objects in the original metric space. We develop a theoretical framework, investigating the convergence rate of deep neural networks under dependent sub-Gaussian noise with bias. The convergence rate of the proposed regression model is then obtained by expanding the scope of local Fréchet regression to accommodate multivariate predictors in the presence of errors in predictors. Simulations and case studies show that the proposed model outperforms existing methods for non-Euclidean responses, focusing on the special cases of probability distributions and networks.

Deep Fréchet Regression

TL;DR

Abstract

Paper Structure (26 sections, 14 theorems, 161 equations, 6 figures, 9 tables, 1 algorithm)

This paper contains 26 sections, 14 theorems, 161 equations, 6 figures, 9 tables, 1 algorithm.

Introduction
Preliminaries
Notation
Deep neural networks
Local Fréchet regression
Methodology
Asymptotic Properties
Implementation and Simulations
Implementation details
Simulations for probability distributions
Data Application
Discussion
Covering Number
ISOMAP
Verification of Model Assumptions
...and 11 more sections

Key Result

Theorem 1

Consider the nonparametric regression model eq:our DNN. Under itm:d1--itm:d2, for any $i=1,\ldots,n$ and $j = 1,\ldots,r$, there exists an estimator $\hat{g}_j$ in eq:DNN Ln such that and where $\bm{X}$ is a new realization of the predictor that is independent of the sample $\{(\bm{X}_i, Y_i)\}_{i=1}^n$ and $\kappa_n$ is defined in eq:kappan. It immediately follows that and where $\hat{\bm{Z}}

Figures (6)

Figure 1: Schematic diagram for the deep Fréchet regression $m= v \circ \bm{g}_0$, where $\bm{Z}^0=\bm{\psi}\{E_\oplus(Y|\bm{X})\}$ denotes the low-dimensional representation of the regression function $E_\oplus(Y|\bm{X})$, $v$ the local Fréchet regression and $\bm{g}_0$ deep neural networks.
Figure 2: (a) Two-dimensional representations of probability distributions $\{Y_i\}_{i=1}^n$ using ISOMAP, along with their true parameters (mean and standard deviation) for sample size $n=500$. (b) Boxplot of mean squared prediction errors for $Q=500$ Monte Carlo runs, comparing deep Fréchet regression (DFR), global Fréchet regression (GFR) mull:19:6 and sufficient dimension reduction (SDR) zhan:21:1.
Figure 3: Two-dimensional representations of 1092 taxi networks using MDS with respect to the Frobenius metric.
Figure 4: Predicted networks represented as heatmaps at different levels of total precipitation on Monday to Thursday, Friday or Saturday, and Sunday or holiday. The left, middle, and right columns show, respectively, the predicted networks at $0$, $2$, and $6$ inches of total precipitation. The three rows depict the predicted networks on Monday to Thursday, Friday or Saturday, and Sunday or holiday, respectively. The value in the top right corner of each panel represents the total ridership of the entire transport network.
Figure S1: Boxplot of mean squared prediction errors for $Q=500$ Monte Carlo runs using deep Fréchet regression (DFR), global Fréchet regression (GFR) mull:22:11 and sufficient dimension reduction (SDR) zhan:21:1 for network responses.
...and 1 more figures

Theorems & Definitions (31)

Theorem 1
Example 1
Example 2
Example 3
Proposition 1
Proposition 2
Theorem 2
Definition S1: Covering number
Proposition S1
proof
...and 21 more

Deep Fréchet Regression

TL;DR

Abstract

Deep Fréchet Regression

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (31)