Deep Feature-specific Imaging
Yizhou Lu, Andreas Velten
TL;DR
DeepFSI addresses the challenge of Poisson-dominated noise in photon-counting sensors by introducing an end-to-end optical-electronic framework that learns measurement masks $\boldsymbol{M}$ through backpropagation under realistic noise. By unfreezing traditional FSI and optimizing the sensing layer jointly with a task-specific classifier, DeepFSI delivers higher feature fidelity and task performance than predefined FSI masks, including under additive Gaussian noise, and demonstrates robust performance across mask counts and photon budgets. The approach is validated on MNIST with simulations and hardware SPC experiments, and extended to CIFAR10 with a Vision Transformer-based end-to-end variant (OViT), showing generalization to more complex tasks. These results suggest a practical pathway for noise-robust, photon-limited computational imaging by co-optimizing optics and inference, with strong potential for real-world imaging systems and downstream computer-vision applications.
Abstract
Modern photon-counting sensors are increasingly dominated by Poisson noise, yet conventional Feature-Specific Imaging (FSI) is optimized for additive Gaussian noise, leading to suboptimal performance and a loss of its advantages under Poisson noise. To address this, we introduce DeepFSI, a novel end-to-end optical-electronic framework. DeepFSI "unfreezes" traditional FSI masks, enabling a deep neural network to learn globally optimal measurement masks by computing gradients directly under realistic Poisson and additive noise conditions. Our simulations demonstrate DeepFSI's superior feature fidelity and task performance compared to conventional FSI with predefined masks, especially in Poisson-Noise-dominant environments. DeepFSI also exhibits enhanced robustness to design choices and performs well under additive Gaussian noise, representing a significant advance for noise-robust computational imaging in photon-limited applications.
