Reinforcement Learning of Adaptive Acquisition Policies for Inverse Problems
Gianluigi Silvestri, Fabio Valerio Massoli, Tribhuvanesh Orekondy, Afshin Abdi, Arash Behboodi
TL;DR
The paper tackles reducing measurement costs in high-dimensional inverse problems by learning adaptive acquisition policies via reinforcement learning. It introduces an end-to-end framework that jointly trains a reconstruction network and a measurement policy, applicable to continuous action spaces, and extends it with a probabilistic belief-state formulation using variational autoencoders. Through experiments on MNIST and MAYO with Gaussian and Radon sensing, the study shows that adaptive strategies improve reconstruction under low-acquisition budgets, with AE-E2E often outperforming baselines, though random measurements can be competitive or superior in long-horizon, high-dimensional settings. The work provides design insights, analyzes theoretical bounds on adaptive sensing, and highlights conditions under which probabilistic adaptive sensing yields the most gains, offering practical guidance for deploying adaptive acquisition in real-world inverse problems.
Abstract
A promising way to mitigate the expensive process of obtaining a high-dimensional signal is to acquire a limited number of low-dimensional measurements and solve an under-determined inverse problem by utilizing the structural prior about the signal. In this paper, we focus on adaptive acquisition schemes to save further the number of measurements. To this end, we propose a reinforcement learning-based approach that sequentially collects measurements to better recover the underlying signal by acquiring fewer measurements. Our approach applies to general inverse problems with continuous action spaces and jointly learns the recovery algorithm. Using insights obtained from theoretical analysis, we also provide a probabilistic design for our methods using variational formulation. We evaluate our approach on multiple datasets and with two measurement spaces (Gaussian, Radon). Our results confirm the benefits of adaptive strategies in low-acquisition horizon settings.
