Enhancing the analysis of murine neonatal ultrasonic vocalizations: Development, evaluation, and application of different mathematical models

Rudolf Herdt; Louisa Kinzel; Johann Georg Maaß; Marvin Walther; Henning Fröhlich; Tim Schubert; Peter Maass; Christian Patrick Schaaf

Enhancing the analysis of murine neonatal ultrasonic vocalizations: Development, evaluation, and application of different mathematical models

Rudolf Herdt, Louisa Kinzel, Johann Georg Maaß, Marvin Walther, Henning Fröhlich, Tim Schubert, Peter Maass, Christian Patrick Schaaf

TL;DR

The study tackles the challenge of reliably analyzing neonatal murine USVs by developing a two-stage pipeline that first detects calls via an entropy-based spectrogram method and then classifies them with a range of neural networks. Through 10-fold cross-validation on a sizable Nr2f1-derived dataset, EfficientNet-B5 and a compact custom CNN achieve top classification performance around $87\%$ accuracy, while a semi-automated mode leverages confidence thresholds to drastically reduce manual review with high recall. The authors provide interpretability analyses (channel visualizations and saliency maps) showing the models attend to core spectrotemporal features, and demonstrate practical utility by detecting quantitative and qualitative USV differences in autism-like mouse lines during development. This framework enables high-throughput, scalable phenotyping of neonatal USVs and offers avenues for future enhancements using attention mechanisms or contrastive learning, with potential broader applicability in rodent communication studies.

Abstract

Rodents employ a broad spectrum of ultrasonic vocalizations (USVs) for social communication. As these vocalizations offer valuable insights into affective states, social interactions, and developmental stages of animals, various deep learning approaches have aimed to automate both the quantitative (detection) and qualitative (classification) analysis of USVs. Here, we present the first systematic evaluation of different types of neural networks for USV classification. We assessed various feedforward networks, including a custom-built, fully-connected network and convolutional neural network, different residual neural networks (ResNets), an EfficientNet, and a Vision Transformer (ViT). Paired with a refined, entropy-based detection algorithm (achieving recall of 94.9% and precision of 99.3%), the best architecture (achieving 86.79% accuracy) was integrated into a fully automated pipeline capable of analyzing extensive USV datasets with high reliability. Additionally, users can specify an individual minimum accuracy threshold based on their research needs. In this semi-automated setup, the pipeline selectively classifies calls with high pseudo-probability, leaving the rest for manual inspection. Our study focuses exclusively on neonatal USVs. As part of an ongoing phenotyping study, our pipeline has proven to be a valuable tool for identifying key differences in USVs produced by mice with autism-like behaviors.

Enhancing the analysis of murine neonatal ultrasonic vocalizations: Development, evaluation, and application of different mathematical models

TL;DR

accuracy, while a semi-automated mode leverages confidence thresholds to drastically reduce manual review with high recall. The authors provide interpretability analyses (channel visualizations and saliency maps) showing the models attend to core spectrotemporal features, and demonstrate practical utility by detecting quantitative and qualitative USV differences in autism-like mouse lines during development. This framework enables high-throughput, scalable phenotyping of neonatal USVs and offers avenues for future enhancements using attention mechanisms or contrastive learning, with potential broader applicability in rodent communication studies.

Abstract

Paper Structure (34 sections, 6 equations, 19 figures, 6 tables, 1 algorithm)

This paper contains 34 sections, 6 equations, 19 figures, 6 tables, 1 algorithm.

Introduction
Material
Signal acquisition setup
Dataset
Syllable classes
Methods
Structure of the pipeline
Segmentation, detection
Neural networks for USV classification
Fully connected neural networks for USV classification (FNN)
Architecture
Data Preprocessing
Training and Regularization
CNN and ViT
Architectures
...and 19 more sections

Figures (19)

Figure 1: Overview of the 5 classes.
Figure 2: Overview of the final pipeline.
Figure 3: A spectrogram displaying three calls, annotated by both the automatic detection (green and red vertical lines) and the manual detection (orange horizontal line).(Color online)
Figure 4: Example of the detection algorithm with corresponding thresholds.(Color online)
Figure 5: Network architecture of the FNN used for USV classification.
...and 14 more figures

Enhancing the analysis of murine neonatal ultrasonic vocalizations: Development, evaluation, and application of different mathematical models

TL;DR

Abstract

Enhancing the analysis of murine neonatal ultrasonic vocalizations: Development, evaluation, and application of different mathematical models

Authors

TL;DR

Abstract

Table of Contents

Figures (19)