DeepIPC: Deeply Integrated Perception and Control for an Autonomous Vehicle in Real Environments
Oskar Natan, Jun Miura
TL;DR
DeepIPC addresses the challenge of integrating perception and control for autonomous driving in real environments by combining an RGBD-based perception front end with a BEV semantic map and a dual-branch controller that predicts three future waypoints and controls via a PID-MLP fusion. The model is trained with a multi-task imitation learning objective and evaluated both offline and online against baselines, demonstrating superior drivability and multi-task efficiency while maintaining a lean architecture. Key contributions include widening the perception ROI, incorporating wheel-speed inputs, using two route points for robustness, and introducing a robust agent-takeover policy, all evaluated with a novel drivability metric. The work suggests that end-to-end perception-control systems with BEV representations can offer practical improvements for real-world autonomous navigation and lays groundwork for further enhancements with additional sensing like LiDAR.
Abstract
In this work, we introduce DeepIPC, a novel end-to-end model tailored for autonomous driving, which seamlessly integrates perception and control tasks. Unlike traditional models that handle these tasks separately, DeepIPC innovatively combines a perception module, which processes RGBD images for semantic segmentation and generates bird's eye view (BEV) mappings, with a controller module that utilizes these insights along with GNSS and angular speed measurements to accurately predict navigational waypoints. This integration allows DeepIPC to efficiently translate complex environmental data into actionable driving commands. Our comprehensive evaluation demonstrates DeepIPC's superior performance in terms of drivability and multi-task efficiency across diverse real-world scenarios, setting a new benchmark for end-to-end autonomous driving systems with a leaner model architecture. The experimental results underscore DeepIPC's potential to significantly enhance autonomous vehicular navigation, promising a step forward in the development of autonomous driving technologies. For further insights and replication, we will make our code and datasets available at https://github.com/oskarnatan/DeepIPC.
