Decision-making and control with diffractive optical networks
Jumin Qiu, Shuyuan Xiao, Lujun Huang, Andrey Miroshnichenko, Dejian Zhang, Tingting Liu, Tianbao Yu
TL;DR
The paper tackles decision-making and control directly from high-dimensional sensory inputs using diffractive optical networks (DONs). It introduces a residual phase-profile DON trained with deep reinforcement learning from self-play, where a policy $π(a|s)$ is learned and then transferred to the optical hardware through backpropagation in the forward diffraction model $F(X)$, with a residual shortcut $F(αX)+(1-α)X$. Validated on Tic-Tac-Toe, Super Mario Bros., and Car Racing, including an experimental Tic-Tac-Toe demonstration with a DMD-SLM system, showing good agreement with simulations. The results suggest a promising all-optical AI path for real-time control in autonomous driving, robotics, and manufacturing, with metasurface implementations proposed for high-density integration.
Abstract
The ultimate goal of artificial intelligence is to mimic the human brain to perform decision-making and control directly from high-dimensional sensory input. Diffractive optical networks provide a promising solution for implementing artificial intelligence with high-speed and low-power consumption. Most of the reported diffractive optical networks focus on single or multiple tasks that do not involve environmental interaction, such as object recognition and image classification. In contrast, the networks capable of performing decision-making and control have not yet been developed to our knowledge. Here, we propose using deep reinforcement learning to implement diffractive optical networks that imitate human-level decision-making and control capability. Such networks taking advantage of a residual architecture, allow for finding optimal control policies through interaction with the environment and can be readily implemented with existing optical devices. The superior performance of these networks is verified by engaging three types of classic games, Tic-Tac-Toe, Super Mario Bros., and Car Racing. Finally, we present an experimental demonstration of playing Tic-Tac-Toe by leveraging diffractive optical networks based on a spatial light modulator. Our work represents a solid step forward in advancing diffractive optical networks, which promises a fundamental shift from the target-driven control of a pre-designed state for simple recognition or classification tasks to the high-level sensory capability of artificial intelligence. It may find exciting applications in autonomous driving, intelligent robots, and intelligent manufacturing.
