Principles and Metrics of Extreme Learning Machines Using a Highly Nonlinear Fiber
Mathilde Hary, Daniel Brunner, Lev Leybov, Piotr Ryczkowski, John M. Dudley, Goëry Genty
TL;DR
The paper investigates an optical Extreme Learning Machine implemented in a highly nonlinear fiber to enable ultrafast computation via Kerr-induced nonlinearities. It defines task-independent metrics PC$^{95}$ and consistency to quantify dimensionality and reliability, and demonstrates how dispersion and fiber length shape the spectral feature space; MNIST classification serves as a task-dependent benchmark. Key findings show that longer fibers and anomalous dispersion increase the effective dimensionality to about PC$^{95}$ ≈ 70–100 under suitable power, with a notable spectral localization near the pump within ~40 nm. Importantly, the study reveals that optimal performance often occurs at moderate input power and dimensionality, achieving up to ~0.88 test accuracy for d = 40 (P_in ≈ 1.72 mW), surpassing the linear baseline, while excessive nonlinearity can degrade consistency and performance. Together, these results establish a framework linking NLSE-driven optical dynamics to computational capacity and point toward ultrafast, low-latency optical co-processing and metrology applications, with future prospects for in-situ learning and real-time operation.
Abstract
Optical computing offers potential for ultra high-speed and low latency computation by leveraging the intrinsic properties of light. Here, we explore the use of highly nonlinear optical fibers (HNLFs) as platforms for optical computing based on the concept of Extreme Learning Machines. Task-independent evaluations are introduced to the field for the first time and focus on the fundamental metrics of effective dimensionality and consistency, which we experimentally characterize for different nonlinear and dispersive conditions. We show that input power and fiber characteristics significantly influence the dimensionality of the computational system, with longer fibers and higher dispersion producing up to 100 principal components (PCs) at input power levels of 30 mW, where the PC correspond to the linearly independent dimensions of the system. The spectral distribution of the PC's eigenvectors reveals that the high-dimensional dynamics facilitating computing through dimensionality expansion are located within 40~nm of the pump wavelength at 1560~nm, providing general insight for computing with nonlinear Schrödinger equation systems. Task-dependent results demonstrate the effectiveness of HNLFs in classifying MNIST dataset images. Using input data compression through PC analysis, we inject MNIST images of various input dimensionality into the system and study the impact of input power upon classification accuracy. At optimized power levels we achieve a classification test accuracy of 88\%, significantly surpassing the baseline of 83.7\% from linear systems. Noteworthy, we find that best performance is not obtained at maximal input power, i.e. maximal system dimensionality, but at more than one order of magnitude lower. The same is confirmed regarding the MNIST image's compression, where accuracy is substantially improved when strongly compressing the image to less than 50 PCs.
