VisPile: A Visual Analytics System for Analyzing Multiple Text Documents With Large Language Models and Knowledge Graphs
Adam Coscia, Alex Endert
TL;DR
VisPile addresses the challenge of sensemaking over large text corpora by integrating large language models and knowledge graphs into a visual analytics workflow. The system supports open-ended document retrieval, pile-based grouping, and evidence validation by linking LLM outputs with KG facts, enabling analysts to compare generated and ground-truth information. The authors present design goals, an open-source implementation, and formative domain expert feedback from six intelligence professionals, highlighting benefits of LLM-KG synergy and the importance of provenance. Limitations include reliance on a single LLM and a KRONOS dataset; future work includes exploring longer documents, diverse LLMs, and improved KG alignment and trust mechanisms. The work demonstrates a practical path toward faster sensemaking with AI-augmented visual analytics in intelligence contexts.
Abstract
Intelligence analysts perform sensemaking over collections of documents using various visual and analytic techniques to gain insights from large amounts of text. As data scales grow, our work explores how to leverage two AI technologies, large language models (LLMs) and knowledge graphs (KGs), in a visual text analysis tool, enhancing sensemaking and helping analysts keep pace. Collaborating with intelligence community experts, we developed a visual analytics system called VisPile. VisPile integrates an LLM and a KG into various UI functions that assist analysts in grouping documents into piles, performing sensemaking tasks like summarization and relationship mapping on piles, and validating LLM- and KG-generated evidence. Our paper describes the tool, as well as feedback received from six professional intelligence analysts that used VisPile to analyze a text document corpus.
