Interpretable contour level selection for heat maps for gridded data
Tarn Duong
TL;DR
Density contour levels offer a probabilistically meaningful way to visualize gridded data via heat maps, but standard density contour estimation requires point data. The authors develop a grid-based approximation that computes $p_j = \delta \hat f(\mathbf g_j)$ on a grid $G$ to obtain $\hat R_\tau$ and $\hat f_\tau$, and extend the method to non-density grid functions using $g^+$ and $g^-$. Through synthetic mixtures and real gridded datasets (e.g., Paris population density and Western Australia temperature anomalies), they show that density-contour visualizations provide superior interpretability and robustness compared to naive quantile, equal-length, and natural contours, with lower symmetric-difference errors. The approach is implemented in open-source R packages, enabling wider adoption for confidential or high-volume gridded data visualization and decision support.
Abstract
Gridded data formats, where the observed multivariate data are aggregated into grid cells, ensure confidentiality and reduce storage requirements, with the trade-off that access to the underlying point data is lost. Heat maps are a highly pertinent visualisation for gridded data, and heat maps with a small number of well-selected contour levels offer improved interpretability over continuous contour levels. There are many possible contour level choices. Amongst them, density contour levels are highly suitable in many cases. Current methods for computing density contour levels requires access to the observed point data, so they are not applicable to gridded data. To remedy this, we introduce an approximation of density contour levels for gridded data. We then compare our proposed method to existing contour level selection methods, and conclude that our proposal provides improved interpretability for synthetic and experimental gridded data.
