Enhancing Sufficient Dimension Reduction via Hellinger Correlation
Seungbeom Hong, Ilmun Kim, Jun Song
TL;DR
This paper addresses sufficient dimension reduction for single-index models by exploiting the Hellinger correlation as a dependence-based objective. By maximizing $H(\alpha^\top X, Y)$ over the unit sphere, it identifies the central subspace $\mathrm{Span}(\eta_0)$ with consistency guarantees, without requiring strong independence assumptions. The approach is implemented via a two-stage optimization that leverages initial values from standard SDR methods and refined by a downhill simplex search, with complexity $O(n^2 p k)$ per iteration. Empirical results across simulated and real data demonstrate substantial improvements over traditional SDR methods and related dependence-based approaches, underscoring the method's robustness to nonlinearity and heavy tails. The work offers a principled, theoretically justified tool for dimension reduction in high-dimensional supervised learning contexts and lays groundwork for extensions to multi-index models and classification.
Abstract
In this work, we develop a new theory and method for sufficient dimension reduction (SDR) in single-index models, where SDR is a sub-field of supervised dimension reduction based on conditional independence. Our work is primarily motivated by the recent introduction of the Hellinger correlation as a dependency measure. Utilizing this measure, we develop a method capable of effectively detecting the dimension reduction subspace, complete with theoretical justification. Through extensive numerical experiments, we demonstrate that our proposed method significantly enhances and outperforms existing SDR methods. This improvement is largely attributed to our proposed method's deeper understanding of data dependencies and the refinement of existing SDR techniques.
