Vector Quantile Regression on Manifolds
Marco Pegoraro, Sanketh Vedula, Aviv A. Rosenberg, Irene Tallini, Emanuele Rodolà, Alex M. Bronstein
TL;DR
This work extends quantile regression to data on non-Euclidean manifolds by formulating conditional vector quantile functions on manifolds (M-CVQFs) via a dual, OT-based approach. It defines the manifold vector quantile function (M-VQF) as a map $Q_{\boldsymbol{Y}}(\boldsymbol{u})=\exp_{\boldsymbol{u}}[-\nabla_{\boldsymbol{u}}\varphi(\boldsymbol{u})]$ and extends it to conditional settings (M-VQR) by learning $c$-concave potentials parameterized with partially input $c$-concave networks. The method enables conditional sampling, likelihood estimation, and confidence-set construction on spheres and tori, demonstrated with synthetic and real datasets, and achieved scalable performance relative to prior spherical approaches. The approach broadens the applicability of QR to domains where data live on curved spaces, offering a principled and scalable way to capture complex conditional distributions on manifolds with potential impact in fields such as climate science, biology, and structural biology.
Abstract
Quantile regression (QR) is a statistical tool for distribution-free estimation of conditional quantiles of a target variable given explanatory features. QR is limited by the assumption that the target distribution is univariate and defined on an Euclidean domain. Although the notion of quantiles was recently extended to multi-variate distributions, QR for multi-variate distributions on manifolds remains underexplored, even though many important applications inherently involve data distributed on, e.g., spheres (climate and geological phenomena), and tori (dihedral angles in proteins). By leveraging optimal transport theory and c-concave functions, we meaningfully define conditional vector quantile functions of high-dimensional variables on manifolds (M-CVQFs). Our approach allows for quantile estimation, regression, and computation of conditional confidence sets and likelihoods. We demonstrate the approach's efficacy and provide insights regarding the meaning of non-Euclidean quantiles through synthetic and real data experiments.
