PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario

Sriram Mandalika; Lalitha V; Athira Nambiar

PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario

Sriram Mandalika, Lalitha V, Athira Nambiar

TL;DR

PRIMEDrive-CoT addresses uncertainty in driving scene understanding by integrating Bayesian Graph Neural Networks for object interactions with precognitive chain-of-thought reasoning and Grad-CAM explanations. The framework fuses LiDAR and multi-view RGB data for robust 3D detection, quantifies uncertainty via entropy and orientation deviation, and computes proximity-based risk to prioritize hazards. It also provides interpretable CoT narratives and visual explanations, with a human-in-the-loop mechanism to refine predictions. On the DriveCoT benchmark, PRIMEDrive-CoT outperforms state-of-the-art CoT and risk-aware methods, demonstrating improved robustness in occluded or complex scenarios and enabling safer, more transparent autonomous driving decisions.

Abstract

Driving scene understanding is a critical real-world problem that involves interpreting and associating various elements of a driving environment, such as vehicles, pedestrians, and traffic signals. Despite advancements in autonomous driving, traditional pipelines rely on deterministic models that fail to capture the probabilistic nature and inherent uncertainty of real-world driving. To address this, we propose PRIMEDrive-CoT, a novel uncertainty-aware model for object interaction and Chain-of-Thought (CoT) reasoning in driving scenarios. In particular, our approach combines LiDAR-based 3D object detection with multi-view RGB references to ensure interpretable and reliable scene understanding. Uncertainty and risk assessment, along with object interactions, are modelled using Bayesian Graph Neural Networks (BGNNs) for probabilistic reasoning under ambiguous conditions. Interpretable decisions are facilitated through CoT reasoning, leveraging object dynamics and contextual cues, while Grad-CAM visualizations highlight attention regions. Extensive evaluations on the DriveCoT dataset demonstrate that PRIMEDrive-CoT outperforms state-of-the-art CoT and risk-aware models.

PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario

TL;DR

Abstract

PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)