Sound Field Estimation Using Optimal Transport Barycenters in the Presence of Phase Errors
Yuyang Liu, Johan Karlsson, Filip Elvander
TL;DR
This work addresses phase-perturbation errors in sound-field estimation by reframing plane-wave coefficients as lifted non-negative measures on the unit circle $\mathbb{T}$. An OT barycenter is computed to obtain a phase-consistent nominal estimate, with the ground cost $c(\psi_1,\psi_2) = |e^{i\psi_1}-e^{i\psi_2}|^2 + \gamma$, and sparsity induced by the $\gamma$ term. The estimator combines transport regularization with a data-fidelity penalty, yielding a convex optimization that recovers the phase-corrected coefficients via $\hat{\boldsymbol{\Phi}} = \int_{\mathbb{T}} e^{i\psi} d\boldsymbol{\mu}^{(0)}(\psi)$. Simulations in 2D show improved NMSE over baseline methods under additive noise and phase errors, with stronger gains as more microphones are used, indicating robust performance and practical potential for sound-field reconstruction under calibration uncertainties.
Abstract
This study introduces a novel approach for estimating plane-wave coefficients in sound field reconstruction, specifically addressing challenges posed by error-in-variable phase perturbations. Such systematic errors typically arise from sensor mis-calibration, including uncertainties in sensor positions and response characteristics, leading to measurement-induced phase shifts in plane wave coefficients. Traditional methods often result in biased estimates or non-convex solutions. To overcome these issues, we propose an optimal transport (OT) framework. This framework operates on a set of lifted non-negative measures that correspond to observation-dependent shifted coefficients relative to the unperturbed ones. By applying OT, the supports of the measures are transported toward an optimal average in the phase space, effectively morphing them into an indistinguishable state. This optimal average, known as barycenter, is linked to the estimated plane-wave coefficients using the same lifting rule. The framework addresses the ill-posed nature of the problem, due to the large number of plane waves, by adding a constant to the ground cost, ensuring the sparsity of the transport matrix. Convex consistency of the solution is maintained. Simulation results confirm that our proposed method provides more accurate coefficient estimations compared to baseline approaches in scenarios with both additive noise and phase perturbations.
