ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning
Changze Li, Ziheng Ji, Zhe Chen, Tong Qin, Ming Yang
TL;DR
This work tackles autonomous parking by introducing a camera-based end-to-end neural planner that converts surround-view RGB images into Bird's Eye View features and fuses them with a target slot through a target query. An autoregressive transformer decoder then predicts future waypoints as serialized tokens, which are executed by a cascaded PID controller for lateral and longitudinal motion. Real-vehicle experiments across four garages show the method achieves high parking success and robust performance in varied scenarios, with ablations confirming the effectiveness of BEV fusion and target-query attention over baselines. While promising, the approach acknowledges a gap to rule-based methods and outlines future directions including reinforcement learning, detailed negative sampling, and advanced simulators to improve robustness and generalization.
Abstract
Autonomous parking is a crucial task in the intelligent driving field. Traditional parking algorithms are usually implemented using rule-based schemes. However, these methods are less effective in complex parking scenarios due to the intricate design of the algorithms. In contrast, neural-network-based methods tend to be more intuitive and versatile than the rule-based methods. By collecting a large number of expert parking trajectory data and emulating human strategy via learning-based methods, the parking task can be effectively addressed. In this paper, we employ imitation learning to perform end-to-end planning from RGB images to path planning by imitating human driving trajectories. The proposed end-to-end approach utilizes a target query encoder to fuse images and target features, and a transformer-based decoder to autoregressively predict future waypoints. We conducted extensive experiments in real-world scenarios, and the results demonstrate that the proposed method achieved an average parking success rate of 87.8% across four different real-world garages. Real-vehicle experiments further validate the feasibility and effectiveness of the method proposed in this paper.
