Simulation of prosthetic vision with PRIMA system and enhancement of face representation
Anna Kochnev Goldstein, Jungyeon Park, Yueming Zhuo, Nathan Jensen, Daniel Palanker
TL;DR
The study addresses the gap between high letter acuity and poor face perception in PRIMA prosthetic vision by introducing ProViSim, a non-pixelated simulator that models resolution and contrast losses. It combines a resolution-reduction pipeline with contrast distortions and validates outputs against patient reports, while enabling ML-driven facial feature enhancement and inverse-contrast pre-processing. The authors demonstrate improvements in emotion recognition and response times in sighted participants and project potential benefits for future smaller-pixel implants when paired with targeted contrast optimization and landmark-based feature reinforcement. The work offers a software-focused pathway to upgrade current PRIMA users and informs hardware development for higher-resolution retinal prostheses, with plans for personalized clinical validation of contrast-loss functions.
Abstract
Objective. Patients implanted with the PRIMA photovoltaic subretinal prosthesis in geographic atrophy report form vision with the average acuity matching the 100um pixel size. Although this remarkable outcome enables them to read and write, they report difficulty with perceiving faces. Despite the pixelated stimulation, patients see smooth patterns rather than dots. We present a novel, non-pixelated algorithm for simulating prosthetic vision, compare its predictions to clinical outcomes, and describe computer vision and machine learning (ML) methods to improve face representation. Approach. Our simulation algorithm (ProViSim) integrates a spatial resolution filter based on sampling density limited by the pixel pitch and a contrast filter representing reduced contrast sensitivity of prosthetic vision. Patterns of Landolt C and human faces created using this simulator are compared to reports from actual PRIMA users. To recover the facial features lost in prosthetic vision due to limited resolution or contrast, we apply an ML facial landmarking model, as well as contrast-adjusting tone curves to the image prior to its projection onto the photovoltaic retinal implant. Main results. Prosthetic vision simulated using the above algorithm matches the letter acuity observed in clinical studies, as well as the patients' descriptions of perceived facial features. Applying the inversed contrast filter to images prior to projection onto the implant and accentuating the facial features using an ML facial landmarking model helps preserve the contrast in prosthetic vision, improves emotion recognition and reduces the response time. Significance. Spatial and contrast constraints of prosthetic vision limit the resolvable features and degrade natural images. ML based methods and contrast adjustments prior to image projection onto the implant mitigate some limitations and improve face representation.
