ATLAS: Constraints-Aware Multi-Agent Collaboration for Real-World Travel Planning
Jihye Choi, Jinsung Yoon, Jiefeng Chen, Somesh Jha, Tomas Pfister
TL;DR
The paper tackles constraint-aware real-world travel planning where explicit, implicit, and evolving constraints challenge grounding in LLM-based systems. It introduces ATLAS, a robust multi-agent framework that decouples constraint construction, constraint-aware answering, and information-gap resolution via a Planner-Checker loop augmented by an adaptive interleaved search, all within a dynamic CSP formalism. Empirical results on TravelPlanner and live-search benchmarks show ATLAS achieving superior final pass rates and substantially reducing hallucinations, outperforming strong baselines across single-turn, multi-turn, and live settings. The work demonstrates the practical viability of constraint-grounded, live-information travel planning and suggests wide applicability to open-world planning tasks beyond sandbox environments.
Abstract
While Large Language Models (LLMs) have shown remarkable advancements in reasoning and tool use, they often fail to generate optimal, grounded solutions under complex constraints. Real-world travel planning exemplifies these challenges, evaluating agents' abilities to handle constraints that are explicit, implicit, and even evolving based on interactions with dynamic environments and user needs. In this paper, we present ATLAS, a general multi-agent framework designed to effectively handle such complex nature of constraints awareness in real-world travel planning tasks. ATLAS introduces a principled approach to address the fundamental challenges of constraint-aware planning through dedicated mechanisms for dynamic constraint management, iterative plan critique, and adaptive interleaved search. ATLAS demonstrates state-of-the-art performance on the TravelPlanner benchmark, improving the final pass rate from 23.3% to 44.4% over its best alternative. More importantly, our work is the first to demonstrate quantitative effectiveness on real-world travel planning tasks with live information search and multi-turn feedback. In this realistic setting, ATLAS showcases its superior overall planning performance, achieving an 84% final pass rate which significantly outperforms baselines including ReAct (59%) and a monolithic agent (27%).
