AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model

Kazuma Komiya; Yoshihisa Fukuhara

AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model

Kazuma Komiya, Yoshihisa Fukuhara

TL;DR

The paper addresses the challenge of expressive and faithful automatic piano covers by proposing AMT-APC, a two-stage framework that fine-tunes a pre-trained Automatic Music Transcription (AMT) model for APC. It uses a base hFT-Transformer AMT model, augmented with a continuous 24-dimensional style vector to control performance style, and trains with a masked cross-entropy loss across onsets, frames, and velocities. On a dataset of 332 songs with 1,267 piano covers, AMT-APC achieves a lower $Q_{\max}$ (0.035) than baselines, and ablation shows the benefits of both AMT pre-training and the style vector. The results illustrate a strong link between AMT and APC tasks and point to future improvements via AMT architectures optimized for APC.

Abstract

There have been several studies on automatically generating piano covers, and recent advancements in deep learning have enabled the creation of more sophisticated covers. However, existing automatic piano cover models still have room for improvement in terms of expressiveness and fidelity to the original. To address these issues, we propose a learning algorithm called AMT-APC, which leverages the capabilities of automatic music transcription models. By utilizing the strengths of well-established automatic music transcription models, we aim to improve the accuracy of piano cover generation. Our experiments demonstrate that the AMT-APC model reproduces original tracks more accurately than any existing models.

AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model

TL;DR

(0.035) than baselines, and ablation shows the benefits of both AMT pre-training and the style vector. The results illustrate a strong link between AMT and APC tasks and point to future improvements via AMT architectures optimized for APC.

Abstract

Paper Structure (16 sections, 4 equations, 3 figures, 2 tables)

This paper contains 16 sections, 4 equations, 3 figures, 2 tables.

Introduction
Background
Automatic Music Transcription
Automatic Piano Cover
Methodology
Base AMT Model
Style Vector
APC Fine-Tuning
Experiment
Dataset
Training
Reproducibility of Original Song
Influence of Style Vector
Ablation Study
Discussion
...and 1 more sections

Figures (3)

Figure 1: Overview of AMT-APC. It consists of AMT pre-training and APC fine-tuning. A pre-trained AMT model is used, and fine-tuning is performed in this study.
Figure 2: Method of extracting the style vector. Probability distributions related to onset rate, velocity, and pitch are extracted and combined to form a single vector.
Figure 3: Differences in piano covers generated by the style vector. Left: Calm style. Right: Intense style.

AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model

TL;DR

Abstract

AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model

Authors

TL;DR

Abstract

Table of Contents

Figures (3)