Table of Contents
Fetching ...

Extreme-PLS with missing data under weak dependence

Stéphane Girard, Cambyse Pakzad

TL;DR

This work develops Extreme-PLS (EPLS) for dimension reduction in the tails of a response with high-dimensional covariates under missing data and weak temporal dependence. It combines a single-index inverse regression model with a MAR mechanism on the covariates and an $\alpha$-mixing framework to derive consistent, and in some regimes asymptotically Gaussian, estimators for the tail projection direction $\beta$. The paper establishes a rigorous asymptotic theory for the EPLS estimator based on tail-moment estimators, accompanied by extensive simulations across eleven dependence schemes (including ARMA, GARCH, and ESTAR) and a real-data NOAA application showing EPLS effectively recovers tail directions. The results demonstrate robustness to heavy tails, missing data, and serial dependence, highlighting practical applicability to environmental and financial time series where extremes are most informative.

Abstract

This paper develops a theoretical framework for Extreme Partial Least Squares (EPLS) dimension reduction in the presence of missing data and weak temporal dependence. Building upon the recent EPLS methodology for modeling extremal dependence between a response variable and high-dimensional covariates, we extend the approach to more realistic data settings where both serial correlation and missing-ness occur. Specifically, we consider a single-index inverse regression model under heavy-tailed conditions and introduce a Missing-at-Random (MAR) mechanism acting on the covariates, whose probability depends on the extremeness of the response. The asymptotic behavior of the proposed estimator is established within an alpha-mixing framework, leading to consistency results under regularly varying tails. Extensive Monte-Carlo experiments covering eleven dependence schemes (including ARMA, GARCH, and nonlinear ESTAR processes) demonstrate that the method performs robustly across a wide range of heavy-tailed and dependent scenarios, even when substantial portions of data are missing. A real-world application to environmental data further confirms the method's capacity to recover meaningful tail directions.

Extreme-PLS with missing data under weak dependence

TL;DR

This work develops Extreme-PLS (EPLS) for dimension reduction in the tails of a response with high-dimensional covariates under missing data and weak temporal dependence. It combines a single-index inverse regression model with a MAR mechanism on the covariates and an -mixing framework to derive consistent, and in some regimes asymptotically Gaussian, estimators for the tail projection direction . The paper establishes a rigorous asymptotic theory for the EPLS estimator based on tail-moment estimators, accompanied by extensive simulations across eleven dependence schemes (including ARMA, GARCH, and ESTAR) and a real-data NOAA application showing EPLS effectively recovers tail directions. The results demonstrate robustness to heavy tails, missing data, and serial dependence, highlighting practical applicability to environmental and financial time series where extremes are most informative.

Abstract

This paper develops a theoretical framework for Extreme Partial Least Squares (EPLS) dimension reduction in the presence of missing data and weak temporal dependence. Building upon the recent EPLS methodology for modeling extremal dependence between a response variable and high-dimensional covariates, we extend the approach to more realistic data settings where both serial correlation and missing-ness occur. Specifically, we consider a single-index inverse regression model under heavy-tailed conditions and introduce a Missing-at-Random (MAR) mechanism acting on the covariates, whose probability depends on the extremeness of the response. The asymptotic behavior of the proposed estimator is established within an alpha-mixing framework, leading to consistency results under regularly varying tails. Extensive Monte-Carlo experiments covering eleven dependence schemes (including ARMA, GARCH, and nonlinear ESTAR processes) demonstrate that the method performs robustly across a wide range of heavy-tailed and dependent scenarios, even when substantial portions of data are missing. A real-world application to environmental data further confirms the method's capacity to recover meaningful tail directions.

Paper Structure

This paper contains 27 sections, 6 theorems, 132 equations, 14 figures, 1 table.

Key Result

Proposition 1

Suppose that $\mathbb{E}(\|X\|\mathds{1}_{\{Y\geq y\}})<\infty$, $\mathbb{E}(|Y|\mathds{1}_{\{Y\geq y\}})<\infty$ and $\mathbb{E}(\|XY\|\mathds{1}_{\{Y\geq y\}})<\infty$ for all $y>0$. Then, the unique solution of the optimization problem (covar) is given for all $y>0$ by:

Figures (14)

  • Figure 1: Simulation results on the inverse model without serial dependence (i.i.d. case), i.e.,$(\phi_{\rm resp},\theta_{\rm resp})=(0,0)$ and $(\omega_{\rm noise},\alpha_{\rm noise},\beta_{\rm noise})=(1,0,0)$.
  • Figure 2: Simulation results with serial dependence 'classic ARMA-resp. + classic GARCH-noise', i.e.,$(\phi_{\rm resp},\theta_{\rm resp})=(0.8,-0.3)$ and $(\omega_{\rm noise},\alpha_{\rm noise},\beta_{\rm noise})=(0.05,0.1,0.85)$.
  • Figure 3: Simulation results on the inverse model with serial dependence of type 'classic ARMA-resp. + IGARCH-like noise', i.e.,$(\phi_{\rm resp},\theta_{\rm resp})=(0.8,-0.3)$ and $(\omega_{\rm noise},\alpha_{\rm noise},\beta_{\rm noise})=(0.05,0.05,0.94)$.
  • Figure 4: Simulation results on the inverse model with serial dependence of type 'pathological ARMA-resp. + classic GARCH noise', i.e.,$(\phi_{\rm resp},\theta_{\rm resp})= (0.99,-0.98)$ and $(\omega_{\rm noise},\alpha_{\rm noise},\beta_{\rm noise})=(0.05,0.1,0.85)$.
  • Figure 5: Simulation results on the inverse model with serial dependence of type 'pathological ARMA-resp. + IGARCH-like noise', i.e.,$(\phi_{\rm resp},\theta_{\rm resp})= (0.99,-0.98)$ and $(\omega_{\rm noise},\alpha_{\rm noise},\beta_{\rm noise})=(0.05,0.05,0.94)$.
  • ...and 9 more figures

Theorems & Definitions (11)

  • Proposition 1
  • Proposition 2
  • Theorem 1
  • Lemma 1
  • proof : Proof
  • Lemma 2
  • proof : Proof
  • Lemma 3
  • proof : Proof
  • proof : Proof of Proposition \ref{['prop-loi-jointe-alpha-mixing']}
  • ...and 1 more