Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL

Aleesha Khurram; Amir Moeini; Shangtong Zhang; Rohan Chandra

Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL

Aleesha Khurram, Amir Moeini, Shangtong Zhang, Rohan Chandra

TL;DR

The paper tackles distribution shifts in end-to-end autonomous driving under adverse weather and dense traffic. It introduces In-Context Reinforcement Learning (ICRL), a prompt-driven, inference-time adaptation framework embedded in a LimSim++-based simulation stack, enabling a driving policy trained in clear weather to adapt without parameter updates. Experimental results in CARLATown05 and Town06 show ICRL outperforms perception- and planning-based prompt baselines in safety, efficiency, and comfort, particularly as weather becomes more inclement or traffic denser, with a focus on safe lane changes and robust junction handling. The work highlights the potential of ICRL as a general, data-efficient layer for few-shot domain adaptation in safety-critical robotics, while outlining theoretical questions and future work toward broader applicability and formal guarantees.

Abstract

Despite significant progress and advances in autonomous driving, many end-to-end systems still struggle with domain adaptation (DA), such as transferring a policy trained under clear weather to adverse weather conditions. Typical DA strategies in the literature include collecting additional data in the target domain or re-training the model, or both. Both these strategies quickly become impractical as we increase scale and complexity of driving. These limitations have encouraged investigation into few-shot and zero-shot prompt-driven DA at inference time involving LLMs and VLMs. These methods work by adding a few state-action trajectories during inference to the prompt (similar to in-context learning). However, there are two limitations of such an approach: $(i)$ prompt-driven DA methods are currently restricted to perception tasks such as detection and segmentation and $(ii)$ they require expert few-shot data. In this work, we present a new approach to inference-time few-shot prompt-driven DA for closed-loop autonomous driving in adverse weather condition using in-context reinforcement learning (ICRL). Similar to other prompt-driven DA methods, our approach does not require any updates to the model parameters nor does it require additional data collection in adversarial weather regime. Furthermore, our approach advances the state-of-the-art in prompt-driven DA by extending to closed driving using general trajectories observed during inference. Our experiments using the CARLA simulator show that ICRL results in safer, more efficient, and more comfortable driving policies in the target domain compared to state-of-the-art prompt-driven DA baselines.

Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL

TL;DR

Abstract

Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)