Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models
Raymond Fok, Nedim Lipka, Tong Sun, Alexa Siu
TL;DR
Marco tackles the burden of information foraging over large business document collections by introducing a collection-centric, mixed-initiative workspace that integrates three views—Document View, Notebook View, Table View—and actions (Search, Ask, Summarize) powered by LLMs. The approach reduces cognitive load and speeds sensemaking, validated by a controlled usability study (n=16) showing 16% faster task completion with Marco and reduced effort, without sacrificing accuracy; a design probe with domain experts (n=7) highlights real-world workflow benefits and trust considerations. The system architecture preprocesses PDFs into a collection context, uses per-document extraction driving a Table View, and employs multi-phase prompting (including GPT-4 for collection-level QA) to synthesize across documents. The work contributes design goals for workplace AI-assisted document sensemaking, demonstrates a practical prototype, and discusses implications for reliability, trust, and future multimodal extensions.
Abstract
Knowledge workers often need to extract and analyze information from a collection of documents to solve complex information tasks in the workplace, e.g., hiring managers reviewing resumes or analysts assessing risk in contracts. However, foraging for relevant information can become tedious and repetitive over many documents and criteria of interest. We introduce Marco, a mixed-initiative workspace supporting sensemaking over diverse business document collections. Through collection-centric assistance, Marco reduces the cognitive costs of extracting and structuring information, allowing users to prioritize comparative synthesis and decision making processes. Users interactively communicate their information needs to an AI assistant using natural language and compose schemas that provide an overview of a document collection. Findings from a usability study (n=16) demonstrate that when using Marco, users complete sensemaking tasks 16% more quickly, with less effort, and without diminishing accuracy. A design probe with seven domain experts identifies how Marco can benefit various real-world workflows.
