Graph Neural Field with Spatial-Correlation Augmentation for HRTF Personalization

De Hu; Junsheng Hu; Cuicui Jiang

Graph Neural Field with Spatial-Correlation Augmentation for HRTF Personalization

De Hu, Junsheng Hu, Cuicui Jiang

TL;DR

This work targets HRTF personalization for unseen subjects by leveraging spatial correlations across directions. It introduces GraphNF-SCA, a three-part framework with a GNN-based HRTF-P module to predict subject-specific HRTFs, a GNN-based HRTF-U module to model directional spatial structure, and a fine-tuning stage that reinforces predictions via spatial relationships. The method achieves state-of-the-art LSD and ILD performance across SONICOM, CIPIC, and HUTUBS datasets, especially in data-scarce scenarios, by effectively integrating retrieved subject information and spatial correlations through graph neural networks and LoRA-based decoding. This approach enables scalable, accurate HRTF personalization for unseen subjects, facilitating high-fidelity immersive spatial audio in VR/AR applications.

Abstract

To achieve immersive spatial audio rendering on VR/AR devices, high-quality Head-Related Transfer Functions (HRTFs) are essential. In general, HRTFs are subject-dependent and position-dependent, and their measurement is time-consuming and tedious. To address this challenge, we propose the Graph Neural Field with Spatial-Correlation Augmentation (GraphNF-SCA) for HRTF personalization, which can be used to generate individual HRTFs for unseen subjects. The GraphNF-SCA consists of three key components: an HRTF personalization (HRTF-P) module, an HRTF upsampling (HRTF-U) module, and a fine-tuning stage. In the HRTF-P module, we predict HRTFs of the target subject via the Graph Neural Network (GNN) with an encoder-decoder architecture, where the encoder extracts universal features and the decoder incorporates the target-relevant features and produces individualized HRTFs. The HRTF-U module employs another GNN to model spatial correlations across HRTFs. This module is fine-tuned using the output of the HRTF-P module, thereby enhancing the spatial consistency of the predicted HRTFs. Unlike existing methods that estimate individual HRTFs position-by-position without spatial correlation modeling, the GraphNF-SCA effectively leverages inherent spatial correlations across HRTFs to enhance the performance of HRTF personalization. Experimental results demonstrate that the GraphNF-SCA achieves state-of-the-art results.

Graph Neural Field with Spatial-Correlation Augmentation for HRTF Personalization

TL;DR

Abstract

Graph Neural Field with Spatial-Correlation Augmentation for HRTF Personalization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)