Reinforcement Learning for Photonic Component Design
Donald Witt, Jeff Young, Lukas Chrostowski
TL;DR
The paper presents a fab-in-the-loop reinforcement learning framework that directly optimizes nanophotonic devices by incorporating real fabrication deviations into the learning loop, thereby overcoming the mismatch between simulations and manufactured geometries. A spectral predictor neural network guides a Deep Deterministic Policy Gradient agent to generate 12-parameter, parameterized grating coupler designs across wavelength bins, using measured data from fabricated chips to continually refine predictions. Applied to photonic crystal grating couplers on an air-clad SOI platform, the approach delivers a measured insertion loss of $3.24$ dB per coupler (vs $8.8$ dB for traditional designs) and broadband designs with $<10.2$ dB loss over a $150$ nm range, with most designs meeting stringent loss criteria. The method’s data efficiency, demonstrated by a single fabrication cycle generating 1250 designs and six iterative rounds, and its generalizability to other photonic components indicate a practical pathway to robust, fabrication-aware photonic design.
Abstract
We present a new fab-in-the-loop reinforcement learning algorithm for the design of nano-photonic components that accounts for the imperfections present in nanofabrication processes. As a demonstration of the potential of this technique, we apply it to the design of photonic crystal grating couplers fabricated on an air clad 220 nm silicon on insulator single etch platform. This fab-in-the-loop algorithm improves the insertion loss from 8.8 to 3.24 dB. The widest bandwidth designs produced using our fab-in-the-loop algorithm can cover a 150 nm bandwidth with less than 10.2 dB of loss at their lowest point.
