EmphasisChecker: A Tool for Guiding Chart and Caption Emphasis
Dae Hyun Kim, Seulgi Choi, Juho Kim, Vidya Setlur, Maneesh Agrawala
TL;DR
This work tackles the problem that readers' takeaways from charts and captions degrade when emphasis is misaligned. It introduces EmphasisChecker, a tool that concurrently highlights visually prominent features in time-series charts and the features emphasized in captions, using a time-series $\\varepsilon$-persistence detector based on the Ramer-Douglas-Peucker algorithm and a text-reference extractor built on Stanford CoreNLP/SUTIME and BERT embeddings. Across real-world data and a dedicated user study, the authors demonstrate that alignment between chart and caption emphasis is common but imperfect, with professional authors showing a 65% match rate and Tableau Public captions being mostly basic. The results indicate EmphasisChecker is useful and easy to use for authoring aligned chart-caption pairs, with potential to guide revisions, reduce misinterpretations, and inform broader extensions to other chart types and accessibility needs.
Abstract
Recent work has shown that when both the chart and caption emphasize the same aspects of the data, readers tend to remember the doubly-emphasized features as takeaways; when there is a mismatch, readers rely on the chart to form takeaways and can miss information in the caption text. Through a survey of 280 chart-caption pairs in real-world sources (e.g., news media, poll reports, government reports, academic articles, and Tableau Public), we find that captions often do not emphasize the same information in practice, which could limit how effectively readers take away the authors' intended messages. Motivated by the survey findings, we present EmphasisChecker, an interactive tool that highlights visually prominent chart features as well as the features emphasized by the caption text along with any mismatches in the emphasis. The tool implements a time-series prominent feature detector based on the Ramer-Douglas-Peucker algorithm and a text reference extractor that identifies time references and data descriptions in the caption and matches them with chart data. This information enables authors to compare features emphasized by these two modalities, quickly see mismatches, and make necessary revisions. A user study confirms that our tool is both useful and easy to use when authoring charts and captions.
