Table of Contents
Fetching ...

Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models

Rui Gan, Pei Li, Keke Long, Bocheng An, Junwei You, Keshu Wu, Bin Ran

TL;DR

LetsPi addresses safe trajectory planning under uncertainty by integrating LLM-based reasoning with a physics-informed social force model. The dual-phase architecture separates deep reasoning (Memory Collection) from real-time planning (Fast Inference), leveraging a memory bank and scenario matching to ensure safety and efficiency. Physics-informed prompting, surrogate safety measures, and a reflection module reduce hallucinations and improve interpretability. Experiments on the HighD highway dataset show LetsPi outperforms baselines on safety metrics while maintaining real-time capabilities.

Abstract

Foundation models have demonstrated strong reasoning and generalization capabilities in driving-related tasks, including scene understanding, planning, and control. However, they still face challenges in hallucinations, uncertainty, and long inference latency. While existing foundation models have general knowledge of avoiding collisions, they often lack transportation-specific safety knowledge. To overcome these limitations, we introduce LetsPi, a physics-informed, dual-phase, knowledge-driven framework for safe, human-like trajectory planning. To prevent hallucinations and minimize uncertainty, this hybrid framework integrates Large Language Model (LLM) reasoning with physics-informed social force dynamics. LetsPi leverages the LLM to analyze driving scenes and historical information, providing appropriate parameters and target destinations (goals) for the social force model, which then generates the future trajectory. Moreover, the dual-phase architecture balances reasoning and computational efficiency through its Memory Collection phase and Fast Inference phase. The Memory Collection phase leverages the physics-informed LLM to process and refine planning results through reasoning, reflection, and memory modules, storing safe, high-quality driving experiences in a memory bank. Surrogate safety measures and physics-informed prompt techniques are introduced to enhance the LLM's knowledge of transportation safety and physical force, respectively. The Fast Inference phase extracts similar driving experiences as few-shot examples for new scenarios, while simplifying input-output requirements to enable rapid trajectory planning without compromising safety. Extensive experiments using the HighD dataset demonstrate that LetsPi outperforms baseline models across five safety metrics.See PDF for project Github link.

Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models

TL;DR

LetsPi addresses safe trajectory planning under uncertainty by integrating LLM-based reasoning with a physics-informed social force model. The dual-phase architecture separates deep reasoning (Memory Collection) from real-time planning (Fast Inference), leveraging a memory bank and scenario matching to ensure safety and efficiency. Physics-informed prompting, surrogate safety measures, and a reflection module reduce hallucinations and improve interpretability. Experiments on the HighD highway dataset show LetsPi outperforms baselines on safety metrics while maintaining real-time capabilities.

Abstract

Foundation models have demonstrated strong reasoning and generalization capabilities in driving-related tasks, including scene understanding, planning, and control. However, they still face challenges in hallucinations, uncertainty, and long inference latency. While existing foundation models have general knowledge of avoiding collisions, they often lack transportation-specific safety knowledge. To overcome these limitations, we introduce LetsPi, a physics-informed, dual-phase, knowledge-driven framework for safe, human-like trajectory planning. To prevent hallucinations and minimize uncertainty, this hybrid framework integrates Large Language Model (LLM) reasoning with physics-informed social force dynamics. LetsPi leverages the LLM to analyze driving scenes and historical information, providing appropriate parameters and target destinations (goals) for the social force model, which then generates the future trajectory. Moreover, the dual-phase architecture balances reasoning and computational efficiency through its Memory Collection phase and Fast Inference phase. The Memory Collection phase leverages the physics-informed LLM to process and refine planning results through reasoning, reflection, and memory modules, storing safe, high-quality driving experiences in a memory bank. Surrogate safety measures and physics-informed prompt techniques are introduced to enhance the LLM's knowledge of transportation safety and physical force, respectively. The Fast Inference phase extracts similar driving experiences as few-shot examples for new scenarios, while simplifying input-output requirements to enable rapid trajectory planning without compromising safety. Extensive experiments using the HighD dataset demonstrate that LetsPi outperforms baseline models across five safety metrics.See PDF for project Github link.

Paper Structure

This paper contains 21 sections, 11 equations, 5 figures, 5 tables.

Figures (5)

  • Figure 1: LetsPi, a dual-phase, physics-informed LLM architecture for safe trajectory planning. The memory collection phase builds the knowledge database through in-depth reasoning and reflection using multiple safety metrics. The fast inference phase distills knowledge from the memory database, enabling timely trajectory planning.
  • Figure 2: a) Physics-informed Prompt Architecture. Essential components include base prompt construction, social force integration, scenario description, and output instruction. Two optional prompts are used for memory enhancement and refinement. b) Fast Prompt Architecture. Without reflection and safety refinement, Fast Prompt comprises simplified base prompt construction, social force integration, scenario description components and minimalistic output requirement with dedicated examples from memory.
  • Figure 3: Reflection Module workflow.
  • Figure 4: The visualization of a two-lane driving scenario. Elements include a trajectory diagram, a time-space diagram, a safety analysis report, and the response from the LLM. The trajectory diagram visualizes the past, predicted, and ground-truth trajectories of the ego-vehicle and neighboring vehicles. The time-space visualizes the relative position between the ego-vehicle and neighboring vehicles. The safety analysis report and reflection response provide detailed information on how the LLM refines its decisions based on safety criteria.
  • Figure 5: The visualization of a three-lane driving scenario. Elements include a trajectory diagram, a time-space diagram, a safety analysis report, and the response from the LLM. The trajectory diagram visualizes the past, predicted, and ground-truth trajectories of the ego-vehicle and neighboring vehicles. The time-space visualizes the relative position between the ego-vehicle and neighboring vehicles. The safety analysis report and reflection response provide detailed information on how the LLM refines its decisions based on safety criteria.