TraveLLM: Could you plan my new public transit route in face of a network disruption?
Bowen Fang, Zixiao Yang, Xuan Di
TL;DR
TraveLLM tackles disruption-aware public transit routing by leveraging LLMs to interpret natural language queries and multimodal map data to generate practical travel plans under dynamic disruptions. The authors propose a two-stage architecture (planner and summary) with prompt engineering, and evaluate GPT-4, Claude 3, and Gemini across a benchmark of disruption scenarios, measuring with $M_{conn}$, $M_{avoid}$, $M_{time}$, and $M_{transfers}$. Results show GPT-4 achieves strong connectivity and avoidance alongside competitive timing, with map imagery and a separate summary agent further improving plan quality and output reliability ($M_{format}$). The work demonstrates a viable path toward more flexible, user-specific navigation systems that integrate real-time disruption information and multimodal data, laying a foundation for future enhancements in visual reasoning and real-world deployment.
Abstract
Existing navigation systems often fail during urban disruptions, struggling to incorporate real-time events and complex user constraints, such as avoiding specific areas. We address this gap with TraveLLM, a system using Large Language Models (LLMs) for disruption-aware public transit routing. We leverage LLMs' reasoning capabilities to directly process multimodal user queries combining natural language requests (origin, destination, preferences, disruption info) with map data (e.g., subway, bus, bike-share). To evaluate this approach, we design challenging test scenarios reflecting real-world disruptions like weather events, emergencies, and dynamic service availability. We benchmark the performance of state-of-the-art LLMs, including GPT-4, Claude 3, and Gemini, on generating accurate travel plans. Our experiments demonstrate that LLMs, notably GPT-4, can effectively generate viable and context-aware navigation plans under these demanding conditions. These findings suggest a promising direction for using LLMs to build more flexible and intelligent navigation systems capable of handling dynamic disruptions and diverse user needs.
