Distributionally Robust Safe Screening
Hiroyuki Hanada, Satoshi Akahane, Tatsuya Aoyama, Tomonari Tanaka, Yoshito Okura, Yu Inatsu, Noriaki Hashimoto, Taro Murayama, Lee Hanju, Shinya Kojima, Ichiro Takeuchi
TL;DR
This work tackles the problem of identifying redundant samples and features in supervised learning under covariate shift with unknown test distributions. It introduces Distributionally Robust Safe Screening (DRSS), a framework that extends safe screening to weighted empirical risk minimization where weights may change within a predefined set, and provides tight guarantees via duality-gap bounds. The approach yields DRSS rules for both samples (DRSsS) and features (DRSfS), and develops concrete algorithms for typical ML setups (e.g., L1/L2-regularized SVMs) including a method to maximize convex quadratic forms over a hyperball. It also extends to deep learning by applying screening to the last layer, and validates the method through numerical experiments on synthetic and LIBSVM datasets as well as a DL example, demonstrating robust screening under weight perturbations. Overall, DRSS offers a practical tool to reduce computation and storage while maintaining performance under distributional uncertainty, with broad applicability from classical convex models to deep learning settings.
Abstract
In this study, we propose a method Distributionally Robust Safe Screening (DRSS), for identifying unnecessary samples and features within a DR covariate shift setting. This method effectively combines DR learning, a paradigm aimed at enhancing model robustness against variations in data distribution, with safe screening (SS), a sparse optimization technique designed to identify irrelevant samples and features prior to model training. The core concept of the DRSS method involves reformulating the DR covariate-shift problem as a weighted empirical risk minimization problem, where the weights are subject to uncertainty within a predetermined range. By extending the SS technique to accommodate this weight uncertainty, the DRSS method is capable of reliably identifying unnecessary samples and features under any future distribution within a specified range. We provide a theoretical guarantee of the DRSS method and validate its performance through numerical experiments on both synthetic and real-world datasets.
