Diffuse Sound Field Synthesis
Franz Zotter, Stefan Riedel, Lukas Gölles, Matthias Frank
TL;DR
This work develops a physics-grounded framework for synthesizing diffuse sound fields from uncorrelated surrounding sources by exploiting Green's functions, Gauss' law, and Newton's spherical shell theorem. It shows that ideal diffuse fields (constant energy density with vanishing intensity) are achievable in circular (2D) or spherical (3D) layouts when the radial decay matches the physical free-field law, $|G(r)|\propto 1/\sqrt{r}^{D-1}$, and, in some geometries, via modified decays $1/r^\beta$ analyzed with Gegenbauer polynomials. The authors identify optimal hulls (circles, spheres, opposing planes) and develop mode-matching strategies to extend diffuseness to non-circular/non-spherical shapes, including ellipsoids and cuboids, as well as discrete realizations through spherical $t$-designs. The work further assesses the practical implications for 2.5D WFS and perceptual relevance by connecting diffuseness to interaural cues (ILD, IC) using perceptual data and KU100 HRTFs. Overall, the paper provides theoretical and computational tools to guide loudspeaker-based diffusion synthesis, offering design principles and discretization guidelines with direct relevance to psychoacoustic outcomes and experimental setups.
Abstract
Can uncorrelated surrounding sound sources be used to generate extended diffuse sound fields? By definition, targets are a constant sound pressure level, a vanishing average sound intensity, uncorrelated sound waves arriving isotropically from all directions. Does this require specific sources and geometries for surrounding 2D and 3D source layouts? As methods, we employ numeric simulations and undertake a series of calculations with uncorrelated circular/spherical source layouts, or such with infinite excess dimensions, and we point out relations to potential theory. Using a radial decay 1/r^b modified by the exponent b, the representation of the resulting fields with hypergeometric functions, Gegenbauer polynomials, and circular as well as spherical harmonics yields fruitful insights. In circular layouts, waves decaying by the exponent b=1/2 synthesize ideally extended, diffuse sound fields; spherical layouts do so with b=1. None of the layouts synthesizes a perfectly constant expected sound pressure level but its flatness is acceptable. Spherical t-designs describe optimal source layouts with well-described area of high diffuseness, and non-spherical, convex layouts can be improved by restoring isotropy or by mode matching for a maximally diffuse synthesis. Theory and simulation offer a basis for loudspeaker-based synthesis of diffuse sound fields and contribute physical reasons to recent psychoacoustic findings in spatial audio.
