Distributed Estimation and Inference for Semi-parametric Binary Response Models
Xi Chen, Wenbo Jing, Weidong Liu, Yichen Zhang
TL;DR
This paper addresses estimation and inference for a semiparametric binary response model in a distributed data setting without assuming a noise distribution, focusing on the maximum score framework which is nondifferentiable and nonregular.It introduces two distributed approaches: Avg-SMSE, a one-shot divide-and-conquer method that smooths the objective to achieve faster rates under relaxed machine-count constraints, and mSMSE, a multiround, iterative smoothing procedure that achieves the optimal nonparametric rate and quadratic convergence with explicit inference procedures.The authors establish asymptotic normality and bias-corrected confidence intervals for the estimators, extend the methods to data heterogeneity (covariate and coefficient shift), and adapt the framework to high-dimensional sparse settings via a Dantzig-selector-like update, supported by extensive simulations.Overall, the smoothing-based divide-and-conquer framework enables scalable, statistically efficient distributed estimation and inference for nondifferentiable semiparametric models, with practical implications for large-scale private or decentralized data analysis.
Abstract
The development of modern technology has enabled data collection of unprecedented size, which poses new challenges to many statistical estimation and inference problems. This paper studies the maximum score estimator of a semi-parametric binary choice model under a distributed computing environment without pre-specifying the noise distribution. An intuitive divide-and-conquer estimator is computationally expensive and restricted by a non-regular constraint on the number of machines, due to the highly non-smooth nature of the objective function. We propose (1) a one-shot divide-and-conquer estimator after smoothing the objective to relax the constraint, and (2) a multi-round estimator to completely remove the constraint via iterative smoothing. We specify an adaptive choice of kernel smoother with a sequentially shrinking bandwidth to achieve the superlinear improvement of the optimization error over the multiple iterations. The improved statistical accuracy per iteration is derived, and a quadratic convergence up to the optimal statistical error rate is established. We further provide two generalizations to handle the heterogeneity of datasets and high-dimensional problems where the parameter of interest is sparse.
