Modelling the Effects of Hearing Loss on Neural Coding in the Auditory Midbrain with Variational Conditioning
Lloyd Pellatt, Fotios Drakopoulos, Shievanie Sabesan, Nicholas A. Lesica
TL;DR
The paper tackles the challenge of modeling central auditory midbrain neural coding under hearing loss by learning a low-dimensional conditioning vector $\psi$ that encodes HI-induced distortions directly from multi-animal neural data. It introduces a variational-conditional extension of ICNet, with a time-degradation module and a KL-based regularization, and compares conditioning on learned $\psi$ versus ABR thresholds. The approach achieves substantial explainable variance capture and enables rapid adaptation to unseen animals via Bayesian optimization over $\psi$, with performance improving as more animals are included in training. This work paves the way for parametric, data-driven hearing-loss compensation approaches that target neural codes in the midbrain and could be quickly tuned to new listeners through human-in-the-loop optimization.
Abstract
The mapping from sound to neural activity that underlies hearing is highly non-linear. The first few stages of this mapping in the cochlea have been modelled successfully, with biophysical models built by hand and, more recently, with DNN models trained on datasets simulated by biophysical models. Modelling the auditory brain has been a challenge because central auditory processing is too complex for models to be built by hand, and datasets for training DNN models directly have not been available. Recent work has taken advantage of large-scale high resolution neural recordings from the auditory midbrain to build a DNN model of normal hearing with great success. But this model assumes that auditory processing is the same in all brains, and therefore it cannot capture the widely varying effects of hearing loss. We propose a novel variational-conditional model to learn to encode the space of hearing loss directly from recordings of neural activity in the auditory midbrain of healthy and noise exposed animals. With hearing loss parametrised by only 6 free parameters per animal, our model accurately predicts 62% of the explainable variance in neural responses from normal hearing animals and 68% for hearing impaired animals, within a few percentage points of state of the art animal specific models. We demonstrate that the model can be used to simulate realistic activity from out of sample animals by fitting only the learned conditioning parameters with Bayesian optimisation, achieving crossentropy loss within 2% of the optimum in 15-30 iterations. Including more animals in the training data slightly improved the performance on unseen animals. This model will enable future development of parametrised hearing loss compensation models trained to directly restore normal neural coding in hearing impaired brains, which can be quickly fitted for a new user by human in the loop optimisation.
