Exploring Definitions of Quality and Diversity in Sonic Measurement Spaces
Björn Þór Jónsson, Çağrı Erdem, Stefano Fasciani, Kyrre Glette
TL;DR
The paper tackles the challenge of exploring vast sonic parameter spaces by replacing manually defined behaviour descriptors with unsupervised dimensionality reduction (PCA and autoencoders) to define and dynamically reconfigure MAP-Elites behaviour spaces. It compares static and dynamic BD configurations and three quality evaluation strategies (single-reference, multiple-reference, and reference-free), showing unsupervised BD markedly increases sonic diversity and maintains quality, while dynamic reconfiguration sustains exploration at some cost to final coverage. PCA offers the best balance of diversity and efficiency, whereas autoencoders deliver perceptual coherence at the expense of lower diversity; multiple-reference evaluation further enhances exploration. Overall, the work advances automated sonic discovery by enabling unbiased, continual exploration of large timbral spaces without supervised training, with practical implications for creative AI and interactive music systems.
Abstract
Digital sound synthesis presents the opportunity to explore vast parameter spaces containing millions of configurations. Quality diversity (QD) evolutionary algorithms offer a promising approach to harness this potential, yet their success hinges on appropriate sonic feature representations. Existing QD methods predominantly employ handcrafted descriptors or supervised classifiers, potentially introducing unintended exploration biases and constraining discovery to familiar sonic regions. This work investigates unsupervised dimensionality reduction methods for automatically defining and dynamically reconfiguring sonic behaviour spaces during QD search. We apply Principal Component Analysis (PCA) and autoencoders to project high-dimensional audio features onto structured grids for MAP-Elites, implementing dynamic reconfiguration through model retraining at regular intervals. Comparison across two experimental scenarios shows that automatic approaches achieve significantly greater diversity than handcrafted behaviour spaces while avoiding expert-imposed biases. Dynamic behaviour-space reconfiguration maintains evolutionary pressure and prevents stagnation, with PCA proving most effective among the dimensionality reduction techniques. These results contribute to automated sonic discovery systems capable of exploring vast parameter spaces without manual intervention or supervised training constraints.
