Automatic Variance Adjustment for Small Area Estimation
Jon Wakefield, Jitong Jiang, Yunhan Wu
TL;DR
This work tackles the instability and non-existence of variance estimates in small area estimation under DHS-like stratified two-stage sampling in LMICs. It introduces Automatic Variance Adjustment via phantom prior augmentation, extending from simple random sampling to complex survey designs and exponential-family settings, and implements it in surveyPrev to enable automated, design-consistent variance fixes. Through simulations modeled on Zambia DHS data and a real application to wasting in children, the approach improves interval coverage and precision in Admin-2 estimates while enabling reliable ranking and spatial borrowing via BYM2 structures. The method offers a practical, scalable solution for producing trustworthy prevalence maps and domain-level inferences in data-sparse settings, with broad applicability across LMIC health indicators.
Abstract
Small area estimation (SAE) is a common endeavor and is used in a variety of disciplines. In low- and middle-income countries (LMICs), in which household surveys provide the most reliable and timely source of data, SAE is vital for highlighting disparities in health and demographic indicators. Weighted estimators are ideal for inference, but for fine geographical partitions in which there are insufficient data, SAE models are required. The most common approach is Fay-Herriot area-level modeling in which the data requirements are a weighted estimate and an associated variance estimate. The latter can be undefined or unstable when data are sparse and so we propose a principled modification which is based on augmenting the available data with a prior sample from a hypothetical survey. This adjustment is generally available, respects the design and is simple to implement. We examine the empirical properties of the adjustment through simulation and illustrate its use with wasting data from a 2018 Zambian Demographic and Health Survey. The modification is implemented as an automatic remedy in the R package surveyPrev, which provides a comprehensive suite of tools for conducing SAE in LMICs.
