FactFlow: Automatic Fact Sheet Generation and Customization from Tabular Dataset via AI Chain Design & Implementation
Minh Duc Vu, Jieshan Chen, Zhenchang Xing, Qinghua Lu, Xiwei Xu, Qian Fu
TL;DR
This paper tackles the challenge of generating informative fact sheets from tabular data for non-experts by introducing FactFlow, an AI-chain system that assembles five specialized AI workers to produce interconnected facts, visuals, and narrative structure. It defines a taxonomy for configuring AI workers, develops a dataset representation and anonymization pipeline, and provides a user-friendly NL-based customization interface. The approach is validated on real-world datasets and through an 18-participant user study showing superior content quality, visual appeal, and customization usability compared with two baselines. The work demonstrates a practical, scalable framework for automated data storytelling that can be deployed as a web platform to support diverse audiences.
Abstract
With the proliferation of data across various domains, there is a critical demand for tools that enable non-experts to derive meaningful insights without deep data analysis skills. To address this need, existing automatic fact sheet generation tools offer heuristic-based solutions to extract facts and generate stories. However, they inadequately grasp the semantics of data and struggle to generate narratives that fully capture the semantics of the dataset or align the fact sheet with specific user needs. Addressing these shortcomings, this paper introduces \tool, a novel tool designed for the automatic generation and customisation of fact sheets. \tool applies the concept of collaborative AI workers to transform raw tabular dataset into comprehensive, visually compelling fact sheets. We define effective taxonomy to profile AI worker for specialised tasks. Furthermore, \tool empowers users to refine these fact sheets through intuitive natural language commands, ensuring the final outputs align closely with individual preferences and requirements. Our user evaluation with 18 participants confirms that \tool not only surpasses state-of-the-art baselines in automated fact sheet production but also provides a positive user experience during customization tasks.
