Table of Contents
Fetching ...

Thinking While Driving: A Concurrent Framework for Real-Time, LLM-Based Adaptive Routing

Xiaopei Tan, Muyang Fan

TL;DR

<3-5 sentence high-level summary>

Abstract

We present Thinking While Driving, a concurrent routing framework that integrates LLMs into a graph-based traffic environment. Unlike approaches that require agents to stop and deliberate, our system enables LLM-based route planning while agents are moving, significantly reducing intersection wait times. Under high traffic, agents average just 0.75 seconds of decision latency. To coordinate many agents in real-time, we implement a non-blocking asynchronous architecture using Unity coroutines and a dedicated request manager. The environment is a weighted undirected graph with live congestion metrics, updated continuously by the agents to enable shared perception. Our results show LLM-driven agents can dynamically adapt to traffic, reroute around congestion, and exhibit behaviors beyond static pathfinding, all while maintaining real-time performance. This work provides a reproducible framework for future research in adaptive routing and multi-agent cooperation.

Thinking While Driving: A Concurrent Framework for Real-Time, LLM-Based Adaptive Routing

TL;DR

<3-5 sentence high-level summary>

Abstract

We present Thinking While Driving, a concurrent routing framework that integrates LLMs into a graph-based traffic environment. Unlike approaches that require agents to stop and deliberate, our system enables LLM-based route planning while agents are moving, significantly reducing intersection wait times. Under high traffic, agents average just 0.75 seconds of decision latency. To coordinate many agents in real-time, we implement a non-blocking asynchronous architecture using Unity coroutines and a dedicated request manager. The environment is a weighted undirected graph with live congestion metrics, updated continuously by the agents to enable shared perception. Our results show LLM-driven agents can dynamically adapt to traffic, reroute around congestion, and exhibit behaviors beyond static pathfinding, all while maintaining real-time performance. This work provides a reproducible framework for future research in adaptive routing and multi-agent cooperation.

Paper Structure

This paper contains 35 sections, 4 figures, 2 tables.

Figures (4)

  • Figure 1: Overview of the concurrent traffic simulation architecture. Agents interact with the physical environment while asynchronously querying the Cognitive Layer for routing decisions.
  • Figure 2: Comparison of Sequential (Stop-and-Think) vs. Concurrent (Thinking While Driving) reasoning models. The concurrent model overlaps inference time with travel time.
  • Figure 3: Impact of the Thinking While Driving framework on Intersection Wait Times. The concurrent architecture effectively eliminates the reasoning delay.
  • Figure 4: Average Journey Time comparison across Low and High Density scenarios. LLM agents outperform A* in high-density settings due to adaptive rerouting.