Generalist Segmentation Algorithm for Photoreceptors Analysis in Adaptive Optics Imaging
Mikhail Kulyabin, Aline Sindel, Hilde Pedersen, Stuart Gilson, Rigmor Baraas, Andreas Maier
TL;DR
This work tackles automated cone segmentation in confocal AOSLO retinal images, addressing the labor-intensive labeling bottleneck with a semi-automatic, human-in-the-loop DL framework. The approach combines Voronoi-based masking, an attention-augmented U-Net that predicts gradient fields and region membership, and center-of-mass extraction to locate cone centers, trained on 20 batches from 18 participants across $0^{\circ}$–$2^{\circ}$ eccentricities. It achieves high F1 scores of $0.968$, $0.958$, and $0.954$ for $0^{\circ}$, $1^{\circ}$, and $2^{\circ}$, respectively, outperforming StarDist and Cellpose while reducing labeling effort by requiring only a fraction of labeled cones and iteratively refining annotations. The method holds practical significance for ophthalmology by enabling scalable, accurate cone density analyses and potential rod detection, with code made available for reproducibility.
Abstract
Analyzing the cone photoreceptor pattern in images obtained from the living human retina using quantitative methods can be crucial for the early detection and management of various eye conditions. Confocal adaptive optics scanning light ophthalmoscope (AOSLO) imaging enables visualization of the cones from reflections of waveguiding cone photoreceptors. While there have been significant improvements in automated algorithms for segmenting cones in confocal AOSLO images, the process of labelling data remains labor-intensive and manual. This paper introduces a method based on deep learning (DL) for detecting and segmenting cones in AOSLO images. The models were trained on a semi-automatically labelled dataset of 20 AOSLO batches of images of 18 participants for 0$^{\circ}$, 1$^{\circ}$, and 2$^{\circ}$ from the foveal center. F1 scores were 0.968, 0.958, and 0.954 for 0$^{\circ}$, 1$^{\circ}$, and 2$^{\circ}$, respectively, which is better than previously reported DL approaches. Our method minimizes the need for labelled data by only necessitating a fraction of labelled cones, which is especially beneficial in the field of ophthalmology, where labelled data can often be limited.
