Distribution-informed and wavelength-flexible data-driven photoacoustic oximetry
Janek Gröhl, Kylie Yeung, Kevin Gu, Thomas R. Else, Monika Golinska, Ellie V. Bunce, Lina Hacker, Sarah E. Bohndiek
TL;DR
This work tackles the challenge of robust, quantitative sO$_2$ estimation from multispectral photoacoustic imaging by introducing a wavelength-flexible LSTM network for spectral unmixing. It demonstrates that training-data selection, guided by Jensen-Shannon divergence, markedly influences accuracy across simulations, phantoms, and in vivo data, and shows that D_JS can predict suitable training datasets for new applications. The approach outperforms linear unmixing and prior data-driven methods, achieving a broader dynamic range and improved alignment with ground-truth or literature references. Together, the flexible LSTM architecture and D_JS-based training-data optimisation offer a promising route toward clinically robust photoacoustic oximetry, while highlighting the need for realistic simulations, 3D modelling, and domain adaptation to bridge the simulation-to-clinical gap.
Abstract
Significance: Photoacoustic imaging (PAI) promises to measure spatially-resolved blood oxygen saturation, but suffers from a lack of accurate and robust spectral unmixing methods to deliver on this promise. Accurate blood oxygenation estimation could have important clinical applications, from cancer detection to quantifying inflammation. Aim: This study addresses the inflexibility of existing data-driven methods for estimating blood oxygenation in PAI by introducing a recurrent neural network architecture. Approach: We created 25 simulated training dataset variations to assess neural network performance. We used a long short-term memory network to implement a wavelength-flexible network architecture and proposed the Jensen-Shannon divergence to predict the most suitable training dataset. Results: The network architecture can handle arbitrary input wavelengths and outperforms linear unmixing and the previously proposed learned spectral decolouring method. Small changes in the training data significantly affect the accuracy of our method, but we find that the Jensen-Shannon divergence correlates with the estimation error and is thus suitable for predicting the most appropriate training datasets for any given application. Conclusions: A flexible data-driven network architecture combined with the Jensen-Shannon Divergence to predict the best training data set provides a promising direction that might enable robust data-driven photoacoustic oximetry for clinical use cases.
