Exploiting spatial diversity for increasing the robustness of sound source localization systems against reverberation
Guillermo Garcia-Barrios, Eduardo Latorre Iglesias, Juana M. Gutierrez-Arriola, Ruben Fraile, Nicolas Saenz-Lechon, Victor Jose Osma-Ruiz
TL;DR
The paper addresses the challenge of robust sound source localization in reverberant rooms and proposes exploiting spatial diversity by combining SRP-PHAT maps from multiple arrays. It defines and analyzes how SRP maps evolve when arrays are placed at different separations and shows that fusing maps via spatial diversity can mitigate reverberation-induced distortions in the likelihood landscape P(r). Through simulations in rooms with reverberation times up to 2 s and real-office measurements, the authors demonstrate that smaller, spatially separated arrays can outperform a single large aperture under high reverberation and that sum fusion of SRP maps yields more robust localization. The work provides practical design guidelines, indicating that maximizing inter-array separation while maintaining within-array correlation leads to improved SSL robustness with simple map fusion, offering a readily implementable alternative to more complex dereverberation or channel-estimation approaches.
Abstract
Acoustic reverberation is one of the most relevant factors that hampers the localization of a sound source inside a room. To date, several approaches have been proposed to deal with it, but have not always been evaluated under realistic conditions. This paper proposes exploiting spatial diversity as an alternative approach to achieve robustness against reverberation. The theoretical arguments supporting this approach are first presented and later confirmed by means of simulation results and real measurements. Simulations are run for reverberation times up to 2 s, thus providing results with a wider range of validity than in other previous research works. It is concluded that the use of systems consisting of several, sufficiently separated, small arrays leads to the best results in reverberant environments. Some recommendations are given regarding the choice of the array sizes, the separation among them, and the way to combine SRP-PHAT maps obtained from diverse arrays.
