Learning-based Autonomous Oversteer Control and Collision Avoidance

Seokjun Lee; Seung-Hyun Kong

Learning-based Autonomous Oversteer Control and Collision Avoidance

Seokjun Lee, Seung-Hyun Kong

TL;DR

The paper addresses safe autonomous driving under oversteer with obstacle avoidance by introducing QC-SAC, a hybrid learning algorithm that learns from suboptimal demonstrations while rapidly adapting to new conditions. QC-SAC integrates three core ideas: Q-Compared Objective (QCO) for selective use of demonstrations, Q-Network from Demonstration (QNfD) to improve Q estimates with demonstration data, and Selective Demonstration Data Update (SDDU) plus Focused Experience Replay (FER) to accelerate learning from new successes and maintain data relevance. The authors validate their approach on a novel benchmark inspired by real driver training, demonstrating near-optimal policies with a significantly higher success rate than IL, RL, and HL baselines. This work provides a practical end-to-end framework for simultaneous oversteer control and collision avoidance, with potential to improve safety in real-world autonomous driving under challenging road conditions.

Abstract

Oversteer, wherein a vehicle's rear tires lose traction and induce unintentional excessive yaw, poses critical safety challenges. Failing to control oversteer often leads to severe traffic accidents. Although recent autonomous driving efforts have attempted to handle oversteer through stabilizing maneuvers, the majority rely on expert-defined trajectories or assume obstacle-free environments, limiting real-world applicability. This paper introduces a novel end-to-end (E2E) autonomous driving approach that tackles oversteer control and collision avoidance simultaneously. Existing E2E techniques, including Imitation Learning (IL), Reinforcement Learning (RL), and Hybrid Learning (HL), generally require near-optimal demonstrations or extensive experience. Yet even skilled human drivers struggle to provide perfect demonstrations under oversteer, and high transition variance hinders accumulating sufficient data. Hence, we present Q-Compared Soft Actor-Critic (QC-SAC), a new HL algorithm that effectively learns from suboptimal demonstration data and adapts rapidly to new conditions. To evaluate QC-SAC, we introduce a benchmark inspired by real-world driver training: a vehicle encounters sudden oversteer on a slippery surface and must avoid randomly placed obstacles ahead. Experimental results show QC-SAC attains near-optimal driving policies, significantly surpassing state-of-the-art IL, RL, and HL baselines. Our method demonstrates the world's first safe autonomous oversteer control with obstacle avoidance.

Learning-based Autonomous Oversteer Control and Collision Avoidance

TL;DR

Abstract

Learning-based Autonomous Oversteer Control and Collision Avoidance

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)