Active Measurement: Efficient Estimation at Scale
Max Hamilton, Jinlin Lai, Wenlong Zhao, Subhransu Maji, Daniel Sheldon
TL;DR
Active measurement tackles the challenge of precise scientific counting by integrating AI predictions with iterative human labeling through adaptive importance sampling. The core idea is to maintain an unbiased Monte Carlo estimator of the total measurement while progressively refining the AI predictor and the sampling distribution as labels accrue, with principled weighting, variance estimation, and confidence intervals. The method demonstrably reduces estimation error across diverse domains, including bird counts in radar data and malaria cell counting, and provides calibrated uncertainty better than existing baselines. Practically, this framework enables accurate, scalable measurements with limited labeling effort and robust uncertainty quantification, with potential for broader adoption in remote sensing, microscopy, and ecological monitoring.
Abstract
AI has the potential to transform scientific discovery by analyzing vast datasets with little human effort. However, current workflows often do not provide the accuracy or statistical guarantees that are needed. We introduce active measurement, a human-in-the-loop AI framework for scientific measurement. An AI model is used to predict measurements for individual units, which are then sampled for human labeling using importance sampling. With each new set of human labels, the AI model is improved and an unbiased Monte Carlo estimate of the total measurement is refined. Active measurement can provide precise estimates even with an imperfect AI model, and requires little human effort when the AI model is very accurate. We derive novel estimators, weighting schemes, and confidence intervals, and show that active measurement reduces estimation error compared to alternatives in several measurement tasks.
