Narrative Player: Reviving Data Narratives with Visuals
Zekai Shao, Leixian Shen, Haotian Li, Yi Shan, Huamin Qu, Yun Wang, Siming Chen
TL;DR
Narrative Player tackles the challenge of transforming data narratives into engaging, data-driven videos by automatically extracting contextualized data facts from paragraphs and data tables and then mapping these facts to coherent visualization sequences optimized for consistency and context. The system comprises two modules: Narrative Analysis, which uses LLMs and sentence embeddings to parse long narratives into data facts, and Visual Generation, which formulates a context-aware visualization sequence via an optimization objective that balances transition costs, visual focus, and primary visualization activation, and then renders a data video with transitions and audio narration. A formal data fact representation as a 6-tuple ($fact := \{ type, parameters, measures, context, breakdowns, focus \}$) and a global objective $\mathcal{F} = \omega_1\mathcal{T} + \omega_2\mathcal{B} + \omega_3\mathcal{P}$ guide the mapping and sequencing of visuals, with $\mathcal{T}$ encoding transition costs, $\mathcal{B}$ a visual focus bonus, and $\mathcal{P}$ a primary-visual activation term. The approach is evaluated via user studies and expert studies, showing that automatically generated data videos are well-received and comparable to human-made videos, and ablations demonstrate the contributions of both narration analysis and visual generation modules. The work advances data storytelling by enabling end-to-end generation of consistent, contextualized, animated data narratives, with practical implications for reading experience, comprehension, and engagement in data-rich documents.
Abstract
Data-rich documents are commonly found across various fields such as business, finance, and science. However, a general limitation of these documents for reading is their reliance on text to convey data and facts. Visual representation of text aids in providing a satisfactory reading experience in comprehension and engagement. However, existing work emphasizes presenting the insights of local text context, rather than fully conveying data stories within the whole paragraphs and engaging readers. To provide readers with satisfactory data stories, this paper presents Narrative Player, a novel method that automatically revives data narratives with consistent and contextualized visuals. Specifically, it accepts a paragraph and corresponding data table as input and leverages LLMs to characterize the clauses and extract contextualized data facts. Subsequently, the facts are transformed into a coherent visualization sequence with a carefully designed optimization-based approach. Animations are also assigned between adjacent visualizations to enable seamless transitions. Finally, the visualization sequence, transition animations, and audio narration generated by text-to-speech technologies are rendered into a data video. The evaluation results showed that the automatic-generated data videos were well-received by participants and experts for enhancing reading.
