Tutorial: A practical guide to the alignment of defocused spatial light modulators for fast diffractive neural networks
Guillaume Noetinger, Tim Tuuva, Romain Fleury
TL;DR
This work demonstrates a scalable, defocused-conjugation approach to align multiple spatial light modulators (SLMs) for optical diffractive neural networks, enabling high-throughput, parallel processing of hundreds of inputs. The authors develop a semi-automatic alignment protocol based on edge-diffraction and ellipse-detection, achieving pixel-level conjugation across ROIs and providing a mapping suitable for batch training. They characterize alignment precision, model the PSF with Huygens-Fresnel theory, and examine the impact of a diaphragm on high-frequency noise and speckle grain, as well as transmission-matrix measurements for refocusing. The study shows substantial speedups in training time via spatial multiplexing and demonstrates noise reduction through averaging, while highlighting challenges such as parasitic reflections that hinder backpropagation and the need for further advances in robust optical training methods. The results offer a practical pathway to scalable optical DNNs with potential applications in wavefront control, imaging, and optical computing, while outlining concrete avenues for improving alignment, TM-based optimization, and training strategies.
Abstract
The conjugation of multiple spatial light modulators (SLMs) enables the construction of optical diffractive neural networks (DNNs). To accelerate training, limited by the low refresh rate of SLMs, spatial multiplexing of the input data across different spatial channels is possible maximizing the number of available spatial degrees of freedom (DoFs). Precise alignment is required in order to ensure that the same physical operation is performed across each channel. We present a semi-automatic procedure for this experimentally challenging alignment resulting in a pixel-level conjugation. It is scalable to any number of SLMs and may be useful in wavefront shaping setups where precise conjugation of SLMs is required, e.g. for the control of optical waves in phase and amplitude. The resulting setup functions as an optical DNN able to process hundreds of inputs simultaneously, thereby reducing training times and experimental noise through spatial averaging. We further present a characterization of the setup and an alignment method.
