Table-as-Search: Formulate Long-Horizon Agentic Information Seeking as Table Completion
Tian Lan, Felix Henry, Bin Zhu, Qianghuai Jia, Junyang Ren, Qihang Pu, Haijun Li, Longyue Wang, Zhao Xu, Weihua Luo
TL;DR
TaS introduces Table-as-Search, a structured planning framework that reformulates long-horizon agentic InfoSeeking as Table Completion. By offloading search results to an external database and maintaining a structured table with rows for candidates and columns for constraints and information, TaS enables robust, scalable, and efficient management of complex, multi-hop information tasks. Across Deep, Wide, and DeepWide benchmarks, TaS consistently outperforms unstructured ReAct baselines and even commercial systems, while demonstrating improved robustness, higher precision-recall, and flexible integration of specialized sub-agents. The work highlights the architectural benefits of explicit state tracking and planning, offering a practical path toward industrial-scale, long-horizon information-seeking systems, with public code and datasets released for reproducibility.
Abstract
Current Information Seeking (InfoSeeking) agents struggle to maintain focus and coherence during long-horizon exploration, as tracking search states, including planning procedure and massive search results, within one plain-text context is inherently fragile. To address this, we introduce \textbf{Table-as-Search (TaS)}, a structured planning framework that reformulates the InfoSeeking task as a Table Completion task. TaS maps each query into a structured table schema maintained in an external database, where rows represent search candidates and columns denote constraints or required information. This table precisely manages the search states: filled cells strictly record the history and search results, while empty cells serve as an explicit search plan. Crucially, TaS unifies three distinct InfoSeeking tasks: Deep Search, Wide Search, and the challenging DeepWide Search. Extensive experiments demonstrate that TaS significantly outperforms numerous state-of-the-art baselines across three kinds of benchmarks, including multi-agent framework and commercial systems. Furthermore, our analysis validates the TaS's superior robustness in long-horizon InfoSeeking, alongside its efficiency, scalability and flexibility. Code and datasets are publicly released at https://github.com/AIDC-AI/Marco-Search-Agent.
