A Modular Deep Learning-based Approach for Diffuse Optical Tomography Reconstruction
Alessandro Benfenati, Paola Causin, Martina Quinteri
TL;DR
This paper tackles the ill-posed inverse problem of Diffuse Optical Tomography (DOT) by introducing Mod-DOT, a modular deep learning framework that separates data and signal representations via autoencoders and links their latent spaces with a bridge network acting as a learned regularizer. The approach leverages decoupled pretraining, a denoising component, and convolutional architectures to reconstruct the absorption coefficient field from boundary measurements more robustly than traditional variational methods or end-to-end networks. Extensive synthetic experiments show Mod-DOT markedly improves recovery under realistic noise levels, with convolutional variants delivering better performance and pretraining enhancing stability. The results suggest substantial practical potential for fast, regularized DOT reconstructions in clinical-like settings, while situating Mod-DOT within the broader context of learned regularization and reduced-order modeling for inverse problems.
Abstract
Medical imaging is nowadays a pillar in diagnostics and therapeutic follow-up. Current research tries to integrate established - but ionizing - tomographic techniques with technologies offering reduced radiation exposure. Diffuse Optical Tomography (DOT) uses non-ionizing light in the Near-Infrared (NIR) window to reconstruct optical coefficients in living beings, providing functional indications about the composition of the investigated organ/tissue. Due to predominant light scattering at NIR wavelengths, DOT reconstruction is, however, a severely ill-conditioned inverse problem. Conventional reconstruction approaches show severe weaknesses when dealing also with mildly complex cases and/or are computationally very intensive. In this work we explore deep learning techniques for DOT inversion. Namely, we propose a fully data-driven approach based on a modularity concept: first data and originating signal are separately processed via autoencoders, then the corresponding low-dimensional latent spaces are connected via a bridging network which acts at the same time as a learned regularizer.
