Deconstructing Jazz Piano Style Using Machine Learning
Huw Cheston, Reuben Bance, Peter M. C. Harrison
TL;DR
This work investigates how machine learning can illuminate jazz piano style by identifying performers and revealing the musical features that distinguish their playing. It compares handcrafted feature pipelines with representation-learning CNNs and introduces a novel four-domain, multi-input architecture that separately encodes melody, harmony, rhythm, and dynamics before integration. The study demonstrates near-state-of-the-art performer identification accuracy, strong interpretability for handcrafted and multi-input models, and informative visualization via LIME and concept analyses tied to jazz theory. Collectively, the methods offer scalable, explainable insights into individual artists’ stylistic signatures with practical implications for pedagogy, analysis, and cross-genre comparisons.
Abstract
Artistic style has been studied for centuries, and recent advances in machine learning create new possibilities for understanding it computationally. However, ensuring that machine-learning models produce insights aligned with the interests of practitioners and critics remains a significant challenge. Here, we focus on musical style, which benefits from a rich theoretical and mathematical analysis tradition. We train a variety of supervised-learning models to identify 20 iconic jazz musicians across a carefully curated dataset of 84 hours of recordings, and interpret their decision-making processes. Our models include a novel multi-input architecture that enables four musical domains (melody, harmony, rhythm, and dynamics) to be analysed separately. These models enable us to address fundamental questions in music theory and also advance the state-of-the-art in music performer identification (94% accuracy across 20 classes). We release open-source implementations of our models and an accompanying web application for exploring musical styles.
