Optimizing Retinal Prosthetic Stimuli with Conditional Invertible Neural Networks
Yuli Wu, Julian Wittmann, Peter Walter, Johannes Stegmaier
TL;DR
This work addresses the challenge of limited information transfer in retinal prostheses by optimizing electrical stimuli to evoke percepts more faithfully. It introduces conditional invertible neural networks (cINN) and an INN-MMD variant to learn an inverse mapping from percepts to stimuli, guided by a physiologically validated Axon Map Model as the forward simulator, and leverages exact likelihoods through normalizing flows with p(x) = π(\mathbf{z}) |det J_h(\mathbf{x})|. The approach uses the NLL loss on invertible architectures and a maximum mean discrepancy (MMD) regularization to handle non-bijective mappings, with conditioning to steer stimulus generation toward target percepts. Empirical results on a 9×9 electrode array with MNIST-based targets show that cINN-based stimulation yields superior reconstruction quality across multiple metrics (MAE, MSE, SSIM, PSNR, ACC), especially at higher percept resolutions, indicating strong potential for improving patient-specific retinal prostheses.
Abstract
Implantable retinal prostheses offer a promising solution to restore partial vision by circumventing damaged photoreceptor cells in the retina and directly stimulating the remaining functional retinal cells. However, the information transmission between the camera and retinal cells is often limited by the low resolution of the electrode array and the lack of specificity for different ganglion cell types, resulting in suboptimal stimulations. In this work, we propose to utilize normalizing flow-based conditional invertible neural networks to optimize retinal implant stimulation in an unsupervised manner. The invertibility of these networks allows us to use them as a surrogate for the computational model of the visual system, while also encoding input camera signals into optimized electrical stimuli on the electrode array. Compared to other methods, such as trivial downsampling, linear models, and feed-forward convolutional neural networks, the flow-based invertible neural network and its conditional extension yield better visual reconstruction qualities w.r.t. various metrics using a physiologically validated simulation tool.
