Trust Your Gut: Comparing Human and Machine Inference from Noisy Visualizations
Ratanond Koonchanok, Michael E. Papka, Khairi Reda
TL;DR
The paper investigates how people infer parameters of known data-generating processes from noisy bivariate visualizations and compares human inferences to Bayesian benchmarks. It introduces a graphical elicitation method to externalize priors and posteriors on the parameters $\mu$ and $\sigma$, and tests three visualization types across varying sample sizes and extremeness in two experiments. Results show that humans are generally less accurate than Bayesian agents, but can outperform them in extreme-sample conditions, especially with icon array displays, and are more resilient to spurious data at the cost of higher variability and overconfidence. The findings motivate human-machine collaboration in visual analytics, suggesting tool designs that leverage analyst intuition alongside formal statistical models to improve inference and decision-making.
Abstract
People commonly utilize visualizations not only to examine a given dataset, but also to draw generalizable conclusions about the underlying models or phenomena. Prior research has compared human visual inference to that of an optimal Bayesian agent, with deviations from rational analysis viewed as problematic. However, human reliance on non-normative heuristics may prove advantageous in certain circumstances. We investigate scenarios where human intuition might surpass idealized statistical rationality. In two experiments, we examine individuals' accuracy in characterizing the parameters of known data-generating models from bivariate visualizations. Our findings indicate that, although participants generally exhibited lower accuracy compared to statistical models, they frequently outperformed Bayesian agents, particularly when faced with extreme samples. Participants appeared to rely on their internal models to filter out noisy visualizations, thus improving their resilience against spurious data. However, participants displayed overconfidence and struggled with uncertainty estimation. They also exhibited higher variance than statistical machines. Our findings suggest that analyst gut reactions to visualizations may provide an advantage, even when departing from rationality. These results carry implications for designing visual analytics tools, offering new perspectives on how to integrate statistical models and analyst intuition for improved inference and decision-making. The data and materials for this paper are available at https://osf.io/qmfv6
