Low-power scalable multilayer optoelectronic neural networks enabled with incoherent light
Alexander Song, Sai Nikhilesh Murty Kottapalli, Rahul Goyal, Bernhard Schölkopf, Peer Fischer
TL;DR
This work tackles the data movement and energy bottlenecks of optical neural accelerators by introducing a multilayer incoherent architecture that interleaves optical matrix-vector multiplications with optoelectronic nonlinear activations. Using 2D LED and photodiode arrays connected through local analog electronics and a single amplitude mask per optical interconnect, the system performs multiple MVMs in sequence with low I/O overhead. Experimentally, a three-layer network demonstrates MNIST digit classification near digital simulations (about 91–92% accuracy) and nonlinear spiral data separation with strong performance, while weight transfer from pretrained networks suggests practical applicability to large-scale models. The approach is scalable, energy-efficient, and amenable to deployment as an optical accelerator for inference, with potential TOPS/W gains as system size and speed increase.
Abstract
Optical approaches have made great strides towards the goal of high-speed, energy-efficient computing necessary for modern deep learning and AI applications. Read-in and read-out of data, however, limit the overall performance of existing approaches. This study introduces a multilayer optoelectronic computing framework that alternates between optical and optoelectronic layers to implement matrix-vector multiplications and rectified linear functions, respectively. Our framework is designed for real-time, parallelized operations, leveraging 2D arrays of LEDs and photodetectors connected via independent analog electronics. We experimentally demonstrate this approach using a system with a three-layer network with two hidden layers and operate it to recognize images from the MNIST database with a recognition accuracy of 92% and classify classes from a nonlinear spiral data with 86% accuracy. By implementing multiple layers of a deep neural network simultaneously, our approach significantly reduces the number of read-ins and read-outs required and paves the way for scalable optical accelerators requiring ultra low energy.
