SAFe-Copilot: Unified Shared Autonomy Framework

Phat Nguyen; Erfan Aasi; Shiva Sreeram; Guy Rosman; Andrew Silva; Sertac Karaman; Daniela Rus

SAFe-Copilot: Unified Shared Autonomy Framework

Phat Nguyen, Erfan Aasi, Shiva Sreeram, Guy Rosman, Andrew Silva, Sertac Karaman, Daniela Rus

TL;DR

The paper tackles the brittleness of autonomous driving in rare or ambiguous scenarios by introducing SAFe-Copilot, a semantic arbitration framework that fuses human input and autonomous plans at a high level using Vision Language Models. It formalizes three modules—Abstraction for high-level plan/state conversion, Uncertainty for detecting unreliable autonomy via an uncertainty score $u_t$, and Reasoning for VLM-based decision making and grounding—whose integration enables proactive fusion or supervisory input depending on confidence. Empirical results in CARLA/Bench2Drive show substantial safety and performance gains: mock-human experiments achieve perfect recall with high accuracy, a human survey reports 92% agreement with arbitration outcomes, and Bench2Drive shows reduced collision rates and improved route completion. Overall, the work demonstrates that semantic, language-based arbitration preserves human intent while leveraging autonomous planning to improve safety and effectiveness in complex driving scenarios.

Abstract

Autonomous driving systems remain brittle in rare, ambiguous, and out-of-distribution scenarios, where human driver succeed through contextual reasoning. Shared autonomy has emerged as a promising approach to mitigate such failures by incorporating human input when autonomy is uncertain. However, most existing methods restrict arbitration to low-level trajectories, which represent only geometric paths and therefore fail to preserve the underlying driving intent. We propose a unified shared autonomy framework that integrates human input and autonomous planners at a higher level of abstraction. Our method leverages Vision Language Models (VLMs) to infer driver intent from multi-modal cues -- such as driver actions and environmental context -- and to synthesize coherent strategies that mediate between human and autonomous control. We first study the framework in a mock-human setting, where it achieves perfect recall alongside high accuracy and precision. A human-subject survey further shows strong alignment, with participants agreeing with arbitration outcomes in 92% of cases. Finally, evaluation on the Bench2Drive benchmark demonstrates a substantial reduction in collision rate and improvement in overall performance compared to pure autonomy. Arbitration at the level of semantic, language-based representations emerges as a design principle for shared autonomy, enabling systems to exercise common-sense reasoning and maintain continuity with human intent.

SAFe-Copilot: Unified Shared Autonomy Framework

TL;DR

Abstract

SAFe-Copilot: Unified Shared Autonomy Framework

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)