A Contrastive Self-Supervised Learning scheme for beat tracking amenable to few-shot learning

Antonin Gagnere; Geoffroy Peeters; Slim Essid

A Contrastive Self-Supervised Learning scheme for beat tracking amenable to few-shot learning

Antonin Gagnere, Geoffroy Peeters, Slim Essid

TL;DR

A novel Self-Supervised-Learning scheme to train rhythm analysis systems and instantiate it for few-shot beat tracking and shows that a model pre-trained using this approach on the unlabeled FMA, MTT and MTG-Jamendo datasets can successfully be fine-tuned in the few-shot regime.

Abstract

In this paper, we propose a novel Self-Supervised-Learning scheme to train rhythm analysis systems and instantiate it for few-shot beat tracking. Taking inspiration from the Contrastive Predictive Coding paradigm, we propose to train a Log-Mel-Spectrogram Transformer encoder to contrast observations at times separated by hypothesized beat intervals from those that are not. We do this without the knowledge of ground-truth tempo or beat positions, as we rely on the local maxima of a Predominant Local Pulse function, considered as a proxy for Tatum positions, to define candidate anchors, candidate positives (located at a distance of a power of two from the anchor) and negatives (remaining time positions). We show that a model pre-trained using this approach on the unlabeled FMA, MTT and MTG-Jamendo datasets can successfully be fine-tuned in the few-shot regime, i.e. with just a few annotated examples to get a competitive beat-tracking performance.

A Contrastive Self-Supervised Learning scheme for beat tracking amenable to few-shot learning

TL;DR

Abstract

A Contrastive Self-Supervised Learning scheme for beat tracking amenable to few-shot learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)