All-Optical Segmentation via Diffractive Neural Networks for Autonomous Driving
Yingjie Li, Daniel Robinson, Cunxi Yu
TL;DR
This work tackles the energy and latency challenges of perception in autonomous driving by introducing all-optical RGB image processing via free-space Diffractive Optical Neural Networks (DONNs). The authors design a three-channel DONN architecture with optical skip connections to perform semantic segmentation and lane detection, and validate the approach on CityScapes, indoor-track lane data, and CARLA simulations, including generalization tests across lighting and maps. Key contributions include a differentiable numerical model for RGB DONNs, a training framework that optimizes phase-modulation weights, and extensive experiments showing competitive segmentation performance (IoU ≈ 0.71 on CityScapes with 12 layers) and robust lane-detection (IoU ≈ 0.80 indoors, with generalization in CARLA). The results demonstrate the practicality and potential of all-optical processing for real-time perception in autonomous driving, while also outlining hardware and binarization challenges that need to be addressed for deployment.
Abstract
Semantic segmentation and lane detection are crucial tasks in autonomous driving systems. Conventional approaches predominantly rely on deep neural networks (DNNs), which incur high energy costs due to extensive analog-to-digital conversions and large-scale image computations required for low-latency, real-time responses. Diffractive optical neural networks (DONNs) have shown promising advantages over conventional DNNs on digital or optoelectronic computing platforms in energy efficiency. By performing all-optical image processing via light diffraction at the speed of light, DONNs save computation energy costs while reducing the overhead associated with analog-to-digital conversions by all-optical encoding and computing. In this work, we propose a novel all-optical computing framework for RGB image segmentation and lane detection in autonomous driving applications. Our experimental results demonstrate the effectiveness of the DONN system for image segmentation on the CityScapes dataset. Additionally, we conduct case studies on lane detection using a customized indoor track dataset and simulated driving scenarios in CARLA, where we further evaluate the model's generalizability under diverse environmental conditions.
