POST: Photonic Swin Transformer for Automated and Efficient Prediction of PCSEL
Qi Xin, Hai Huang, Chenyu Li, Kewei Shi, Zhaoyu Zhang
TL;DR
This work introduces POST, a Photonic Swin Transformer that acts as a fast and accurate surrogate for Coupled-Wave Theory simulations of Photonic Crystal Surface Emitting Lasers. By processing 32×32 single-channel dielectric patterns through a four-stage Swin Transformer, POST achieves $R^2$ values of $0.909$ for $\log Q$ and $0.779$ for $SE$, while delivering up to about $5{,}000$ predictions per second, enabling rapid exploration of large design spaces including irregular hole geometries and multi-lattice configurations. The training dataset, comprising over 25{,}000 CWT-generated samples, is augmented with rotations, flips, and translations, and can be publicly released to support cross-disciplinary research. SHAP analyses in the Fourier domain reveal that POST leverages physics-prior Fourier components, validating alignment with CWT physics and offering guidance for inverse design and accelerated optimization.
Abstract
This work designs a model named POST based on the Vision Transformer (ViT) approach. Across single, double, and even triple lattices, as well as various non-circular complex hole structures, POST enables prediction of multiple optical properties of photonic crystal layers in Photonic Crystal Surface Emitting Lasers (PCSELs) with high speed and accuracy, without requiring manual intervention, which serves as a comprehensive surrogate for the optical field simulation. In the predictions of Quality Factor (Q) and Surface-emitting Efficiency (SE) for PCSEL, the R-squared values reach 0.909 and 0.779, respectively. Additionally, it achieves nearly 5,000 predictions per second, significantly lowering simulation costs. The precision and speed of POST predictions lay a solid foundation for future ultra-complex model parameter tuning involving dozens of parameters. It can also swiftly meets designers' ad-hoc requirements for evaluating photonic crystal properties. The database used for training the POST model is derived from predictions of different photonic crystal structures using the Coupled-Wave Theory (CWT) model. This dataset will be made publicly available to foster interdisciplinary research advancements in materials science and computer science.
