ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums

Scott H. Hawley; Andrew C. Morrison

ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums

Scott H. Hawley, Andrew C. Morrison

TL;DR

The paper presents SPNet, a CNN-based detector trained to count interference fringes within elliptical antinode regions in ESPI videos of transient steelpan vibrations. By combining crowdsourced SVP annotations with synthetic, style-transferred data, the authors demonstrate high accuracy on synthetic datasets and provide initial, physically meaningful insights from optical measurements, including octave-frequency alignment and notable delays relative to acoustic signals. The work highlights the challenges of real-world annotation variability and proposes path forward through improved labeling, transfer learning, and physics-informed data generation. This approach offers a scalable framework for extracting time-dependent vibrational dynamics from ESPI imagery, with potential applicability to other musical-instrument diagnostics and transient ESPI analyses.

Abstract

We train an object detector built from convolutional neural networks to count interference fringes in elliptical antinode regions in frames of high-speed video recordings of transient oscillations in Caribbean steelpan drums illuminated by electronic speckle pattern interferometry (ESPI). The annotations provided by our model aim to contribute to the understanding of time-dependent behavior in such drums by tracking the development of sympathetic vibration modes. The system is trained on a dataset of crowdsourced human-annotated images obtained from the Zooniverse Steelpan Vibrations Project. Due to the small number of human-annotated images and the ambiguity of the annotation task, we also evaluate the model on a large corpus of synthetic images whose properties have been matched to the real images by style transfer using a Generative Adversarial Network. Applying the model to thousands of unlabeled video frames, we measure oscillations consistent with audio recordings of these drum strikes. One unanticipated result is that sympathetic oscillations of higher-octave notes significantly precede the rise in sound intensity of the corresponding second harmonic tones; the mechanism responsible for this remains unidentified. This paper primarily concerns the development of the predictive model; further exploration of the steelpan images and deeper physical insights await its further application.

ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums

TL;DR

Abstract

ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)