Transfer Learning for High-dimensional Quantile Regression with Distribution Shift
Ruiqi Bai, Yijiao Zhang, Hanbo Yang, Zhongyi Zhu
TL;DR
This work addresses transferring knowledge to high-dimensional quantile regression in the presence of distribution shift across multiple sources. It introduces a transferable set that jointly accounts for parameter and residual shift, and develops a constrained L1-minimization framework to estimate the target quantile coefficients and source contrasts while mitigating covariate shift. A Neyman orthogonality–based debiased inference procedure leverages informative sources to achieve sqrt($n_{\mathcal C}$)–normality, with variance that reflects residual shift via the objective quantile density. Theoretical non-asymptotic error bounds, a detection-consistent transferable-set screening method, and empirical evidence from simulations and GTEx data demonstrate improved prediction accuracy and sharper inference while avoiding negative transfer under distribution shift.
Abstract
Information from related source studies can often enhance the findings of a target study. However, the distribution shift between target and source studies can severely impact the efficiency of knowledge transfer. In the high-dimensional regression setting, existing transfer approaches mainly focus on the parameter shift. In this paper, we focus on the high-dimensional quantile regression with knowledge transfer under three types of distribution shift: parameter shift, covariate shift, and residual shift. We propose a novel transferable set and a new transfer framework to address the above three discrepancies. Non-asymptotic estimation error bounds and source detection consistency are established to validate the availability and superiority of our method in the presence of distribution shift. Additionally, an orthogonal debiased approach is proposed for statistical inference with knowledge transfer, leading to sharper asymptotic results. Extensive simulation results as well as real data applications further demonstrate the effectiveness of our proposed procedure.
