Table of Contents
Fetching ...

Optimal design for linear models via gradient flow

Ruhui Jin, Martin Guerra, Qin Li, Stephen Wright

TL;DR

This work tackles optimal experimental design when the design space is continuous by reframing the problem as optimization over probability measures endowed with the Wasserstein metric. It introduces a Wasserstein gradient-flow framework, approximated via Monte Carlo particles, to minimize or maximize standard OED criteria (A-optimal and D-optimal) in the continuous setting. The paper establishes first-order criticality conditions, proves convexity in the L2 sense, and analyzes particle-flow simulation error, providing a groundwork for convergence and stability. The authors demonstrate the approach on linearized inverse problems, notably electrical impedance tomography and one-dimensional Darcy flow, revealing informative design structures and highlighting initialization-dependent nonconvexities in Wasserstein space. The methodology offers a scalable, measure-theoretic path to continuous OED with potential impact across PDE-constrained inference and sensor-placement problems.

Abstract

Optimal experimental design (OED) aims to choose the observations in an experiment to be as informative as possible, according to certain statistical criteria. In the linear case (when the observations depend linearly on the unknown parameters), it seeks the optimal weights over rows of the design matrix A under certain criteria. Classical OED assumes a discrete design space and thus a design matrix with finite dimensions. In many practical situations, however, the design space is continuous-valued, so that the OED problem is one of optimizing over a continuous-valued design space. The objective becomes a functional over the probability measure, instead of over a finite dimensional vector. This change of perspective requires a new set of techniques that can handle optimizing over probability measures, and Wasserstein gradient flow becomes a natural candidate. Both the first-order criticality and the convexity properties of the OED objective are presented. Computationally Monte Carlo particle simulation is deployed to formulate the main algorithm. This algorithm is applied to two elliptic inverse problems.

Optimal design for linear models via gradient flow

TL;DR

This work tackles optimal experimental design when the design space is continuous by reframing the problem as optimization over probability measures endowed with the Wasserstein metric. It introduces a Wasserstein gradient-flow framework, approximated via Monte Carlo particles, to minimize or maximize standard OED criteria (A-optimal and D-optimal) in the continuous setting. The paper establishes first-order criticality conditions, proves convexity in the L2 sense, and analyzes particle-flow simulation error, providing a groundwork for convergence and stability. The authors demonstrate the approach on linearized inverse problems, notably electrical impedance tomography and one-dimensional Darcy flow, revealing informative design structures and highlighting initialization-dependent nonconvexities in Wasserstein space. The methodology offers a scalable, measure-theoretic path to continuous OED with potential impact across PDE-constrained inference and sensor-placement problems.

Abstract

Optimal experimental design (OED) aims to choose the observations in an experiment to be as informative as possible, according to certain statistical criteria. In the linear case (when the observations depend linearly on the unknown parameters), it seeks the optimal weights over rows of the design matrix A under certain criteria. Classical OED assumes a discrete design space and thus a design matrix with finite dimensions. In many practical situations, however, the design space is continuous-valued, so that the OED problem is one of optimizing over a continuous-valued design space. The objective becomes a functional over the probability measure, instead of over a finite dimensional vector. This change of perspective requires a new set of techniques that can handle optimizing over probability measures, and Wasserstein gradient flow becomes a natural candidate. Both the first-order criticality and the convexity properties of the OED objective are presented. Computationally Monte Carlo particle simulation is deployed to formulate the main algorithm. This algorithm is applied to two elliptic inverse problems.
Paper Structure (21 sections, 5 theorems, 69 equations, 14 figures, 1 table, 1 algorithm)

This paper contains 21 sections, 5 theorems, 69 equations, 14 figures, 1 table, 1 algorithm.

Key Result

Proposition 1

Assume that the continuous experiment/data matrix $\mathbf{A}$ defined in this section is full rank. The Fréchet derivatives for eqn: cont A-opt and eqn: cont D-opt are

Figures (14)

  • Figure 1:
  • Figure 2: The first row shows media configurations \ref{['eqn: homo']}, \ref{['eqn: inhomo']} for homogeneous media \ref{['eqn: homo']} and inhomogeneous media \ref{['eqn: inhomo']}, respectively. The second row shows the landscape of the A-optimal objective \ref{['eqn: cont A-opt']} captured by $\rho_L$ defined in \ref{['eqn:rho_parameter_plot']}. For reference, the dashed lines are obtained from a uniform sampling distribution over the entire boundary $\partial \mathcal{D}^2$.
  • Figure 3: The landscape of the D-optimal objective \ref{['eqn: cont D-opt']} depicted by $\rho_L$ defined by \ref{['eqn:rho_parameter_plot']} for honogeneous media \ref{['eqn: homo']}. For reference, we plot the solution to both the optimal sampling strategy achieved by the classical Fedorov method F13JD75, and the uniform sampling.
  • Figure 4: Initialization strategies for particle sampling, illustrating regions Init.\ref{['init: entire space']}, \ref{['init: L-shape']}, \ref{['init: diagonal']}, respectively.
  • Figure 5: The gradients are computed via the results are computed using \ref{['eqn: A velocity']}. (a) and (b) respectively shows the gradient magnitude and field for the homogeneous case $\mathbf{A}_\mathrm{h}.$ The red arrows in (b) indicate the gradient vector \ref{['eqn: A velocity']} directions. (c) shows the gradient magnitude for the inhomogeneous case $\mathbf{A}_{\mathrm{ih}}$.
  • ...and 9 more figures

Theorems & Definitions (18)

  • Remark 1
  • Remark 2
  • Definition 1
  • Proposition 1
  • proof
  • Remark 3
  • Remark 4
  • Proposition 2
  • proof
  • Remark 5
  • ...and 8 more