Table of Contents
Fetching ...

Sensor array and camera fusion via unbalanced optimal transport for 3D source localization

Ilyes Jaouedi, Gilles Chardon, José Picheral

Abstract

We address the problem of localizing multiple sources in 3D by combining sensor array measurements with camera observations. We propose a fusion framework extending the covariance matrix fitting method with an unbalanced optimal transport regularization term that softly aligns sensor array responses with visual priors while allowing flexibility in mass allocation. To solve the resulting largescale problem, we adopt a greedy coordinate descent algorithm that efficiently updates the transport plan. Its computational efficiency makes full 3D localization feasible in practice. The proposed framework is modular and does not rely on labeled data or training, in contrast with deep learning-based fusion approaches. Although validated here on acoustic arrays, the method is general to arbitrary sensor arrays. Experiments on real data show that the proposed approach improves localization accuracy compared to sensor-only baselines.

Sensor array and camera fusion via unbalanced optimal transport for 3D source localization

Abstract

We address the problem of localizing multiple sources in 3D by combining sensor array measurements with camera observations. We propose a fusion framework extending the covariance matrix fitting method with an unbalanced optimal transport regularization term that softly aligns sensor array responses with visual priors while allowing flexibility in mass allocation. To solve the resulting largescale problem, we adopt a greedy coordinate descent algorithm that efficiently updates the transport plan. Its computational efficiency makes full 3D localization feasible in practice. The proposed framework is modular and does not rely on labeled data or training, in contrast with deep learning-based fusion approaches. Although validated here on acoustic arrays, the method is general to arbitrary sensor arrays. Experiments on real data show that the proposed approach improves localization accuracy compared to sensor-only baselines.

Paper Structure

This paper contains 13 sections, 8 equations, 5 figures.

Figures (5)

  • Figure 1: Spatial Configuration
  • Figure 2: Experimental Setup
  • Figure 3: Estimated source distributions on the 3D grid.
  • Figure 4: Mean squared localization error versus SNR.
  • Figure 5: Mean squared localization error as a function of source distance from the array (fixed SNR = 10 dB).