Strong Formulations and Algorithms for Regularized A-optimal Design
Yongchun Li
TL;DR
The paper tackles Regularized A-optimal Design ($RAOD$), proving its NP-hardness and introducing a stronger convex integer programming formulation whose convex relaxation yields bounded optimality gaps for all $k$. It develops an exact cutting-plane algorithm based on a novel convex envelope (conv$\Gamma$) and a corresponding MILP reformulation, plus scalable forward and backward greedy algorithms with data-independent guarantees for different $k$ regimes. Extensive numerical experiments on synthetic and real data (including a user cold-start recommendation setting) demonstrate that the proposed exact method substantially outperforms existing relaxations and that the greedy strategies provide high-quality, scalable solutions. The work offers practical, non-sequential experimental design tools with strong theoretical guarantees, suitable for high-dimensional, small-$k$ scenarios and broader Bayesian-design-inspired applications.
Abstract
We study the Regularized A-optimal Design (RAOD) problem, which selects a subset of $k$ experiments to minimize the inverse of the Fisher information matrix, regularized with a scaled identity matrix. RAOD has broad applications in Bayesian experimental design, sensor placement, and cold-start recommendation. We prove its NP-hardness via a reduction from the independent set problem. By leveraging convex envelope techniques, we propose a new convex integer programming formulation for RAOD, whose continuous relaxation dominates those of existing formulations. More importantly, we demonstrate that our continuous relaxation achieves bounded optimality gaps for all $k$, whereas previous relaxations may suffer from unbounded gaps. This new formulation enables the development of an exact cutting-plane algorithm with superior efficiency, especially in high-dimensional and small-$k$ scenarios. We also investigate scalable forward and backward greedy algorithms for solving RAOD, each with provable performance guarantees for different $k$ ranges. Finally, our numerical results on synthetic and real data demonstrate the efficacy of the proposed exact and approximation algorithms. We further showcase the practical effectiveness of RAOD by applying it to a real-world user cold-start recommendation problem.
