Visual Validation versus Visual Estimation: A Study on the Average Value in Scatterplots
Daniel Braun, Ashley Suh, Remco Chang, Michael Gleicher, Tatiana von Landesberger
TL;DR
This study probes how people visually validate a simple statistical model (the mean) in scatterplots and how this compares to visual estimation. Through two between-subject online studies (volunteers and crowdsourced participants), the authors find that validation is less accurate than estimation but both are unbiased, and that the threshold for acceptance aligns with the boundary of the $95\%$ confidence interval. The work demonstrates that viewers’ perceived confidence intervals reflect a conventional statistical standard and offers a baseline for understanding visual validation, with implications for visualization design and further research across data representations. Overall, the findings reveal that people are consistent in validation yet conservative in deviation tolerance, informing how visualizations should present or precompute model fits. The results motivate extending the analysis beyond averages to broader models and settings in data visualization and decision support.
Abstract
We investigate the ability of individuals to visually validate statistical models in terms of their fit to the data. While visual model estimation has been studied extensively, visual model validation remains under-investigated. It is unknown how well people are able to visually validate models, and how their performance compares to visual and computational estimation. As a starting point, we conducted a study across two populations (crowdsourced and volunteers). Participants had to both visually estimate (i.e, draw) and visually validate (i.e., accept or reject) the frequently studied model of averages. Across both populations, the level of accuracy of the models that were considered valid was lower than the accuracy of the estimated models. We find that participants' validation and estimation were unbiased. Moreover, their natural critical point between accepting and rejecting a given mean value is close to the boundary of its 95% confidence interval, indicating that the visually perceived confidence interval corresponds to a common statistical standard. Our work contributes to the understanding of visual model validation and opens new research opportunities.
