Flexible Imputation of Incomplete Network Data

Ge Sun; Weisheng Zhang

Flexible Imputation of Incomplete Network Data

Ge Sun, Weisheng Zhang

Abstract

Sampled network data are common in empirical research because collecting full network information is costly, but using sampled networks can lead to biased estimates. We propose a nonparametric imputation method for sampled networks and show that empirical analysis based on imputed networks yields consistent parameter estimates. Our approach imputes missing network links by combining a projection onto covariates with a local two-way fixed-effects regression, which avoids parametric assumptions, does not rely on low-rank restrictions, and flexibly accommodates both observed covariates and unobserved heterogeneity. We establish entrywise convergence rates for the imputed matrix and prove the consistency of GMM estimators based on the imputed network. We further derive the convergence rate of the corresponding estimator in the linear-in-means peer-effects model. Simulations show strong performance of our method both in terms of imputation accuracy and in downstream empirical analysis. We illustrate our method with an application to the microfinance network data of Banerjee et al. (2013).

Flexible Imputation of Incomplete Network Data

Abstract

Flexible Imputation of Incomplete Network Data

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (20)