Engineering nonlinear activation functions for all-optical neural networks via quantum interference
Ruben Canora, Xinzhe Xu, Ziqi Niu, Hadiseh Alaeian, Shengwang Du
TL;DR
This work tackles the nonlinear activation bottleneck in all-optical neural networks by engineering quantum-interference–based nonlinearities in a three-level atomic medium. The authors develop both lifetime-broadened theory and Doppler-broadened experimental validation in rubidium vapor, achieving sigmoid- and ReLU-like activations at ultralow powers, down to about $17~\mu$W per neuron, with scalability under roughly $20$ W total. They introduce a two-field, two-channel activation scheme enabling true MIMO functionality and demonstrate all-optical gradients suitable for backpropagation. Together, the results point to a practical route for high-speed, energy-efficient optical AI hardware, with potential for on-chip integration and optical training to support networks with millions of neurons.
Abstract
All-optical neural networks (AONNs) promise transformative gains in speed and energy efficiency for artificial intelligence (AI) by leveraging the intrinsic parallelism and wave nature of light. However, their scalability has been fundamentally limited by the high power requirements of conventional nonlinear optical elements. Here, we present a low-power nonlinear activation scheme based on a three-level quantum system driven by dual laser fields. This platform introduces a two-channel nonlinear activation matrix with both self- and cross-nonlinearities, enabling true multi-input, multi-output optical processing. The system supports tunable activation behaviors, including sigmoid and ReLU functions, at ultralow power levels (17 uW per neuron). We validate our approach through theoretical modeling and experimental demonstration in rubidium vapor cells, showing the feasibility of scaling to deep AONNs with millions of neurons operating under 20 W of total optical power. Crucially, we also demonstrate the all-optical generation of gradient-like signals with backpropagation, paving the way for all optical training. These results mark a major advance toward scalable, high-speed, and energy-efficient optical AI hardware.
