Orca: Browsing at Scale Through User-Driven and AI-Facilitated Orchestration Across Malleable Webpages
Peiling Jiang, Haijun Xia
TL;DR
Orca tackles the challenge of information foraging on the modern, distributed Web by redefining the browser as a malleable workspace and webpages as malleable materials. It blends user-driven interaction with AI-facilitated orchestration to enable viewing, navigating, organizing, extracting, operating, and synthesizing across many pages at scale. The paper introduces a spatial Web Canvas, extraction and synthesis mechanisms, and parallel agent-based automation, supported by a modular feedforward prompting system. An initial laboratory evaluation indicates enhanced exploration, flexible task organization, and increased user trust in results, while highlighting potential cognitive load and coordination challenges for multi-agent workflows. Overall, Orca points to a promising direction for future browser designs that tightly integrate AI to augment, rather than replace, human sensemaking across the Web.
Abstract
Web-based activities are fundamentally distributed across webpages. However, conventional browsers with stacks of tabs fail to support operating and synthesizing large volumes of information across pages. While recent AI systems enable fully automated web browsing and information synthesis, they often diminish user agency and hinder contextual understanding. Therefore, we explore how AI could instead augment users' interactions with content across webpages and mitigate cognitive and manual efforts. Through literature on information tasks and web browsing challenges, and an iterative design process, we present a rich set of novel interactions with our prototype web browser, Orca. Leveraging AI, Orca supports user-driven exploration, operation, organization, and synthesis of web content at scale. To enable browsing at scale, webpages are treated as malleable materials that humans and AI can collaboratively manipulate and compose into a malleable, dynamic, and browser-level workspace. Our evaluation revealed an increased "appetite" for information foraging, enhanced user control, and more flexibility in sensemaking across a broader information landscape on the web.
