AutoSchA: Automatic Hierarchical Music Representations via Multi-Relational Node Isolation

Stephen Ni-Hahn; Rico Zhu; Jerry Yin; Yue Jiang; Cynthia Rudin; Simon Mak

AutoSchA: Automatic Hierarchical Music Representations via Multi-Relational Node Isolation

Stephen Ni-Hahn, Rico Zhu, Jerry Yin, Yue Jiang, Cynthia Rudin, Simon Mak

TL;DR

This work reframes Schenkerian hierarchical analysis as a graph pooling problem on symbolic music and introduces AutoSchA, a multi-relational GNN with a novel node-isolation pooling mechanism. The approach combines directed multi-relational convolution, adaptive pooling losses, and two global feature strategies (sequential and subspace merging) to infer depth-wise hierarchical structures and voice assignments. Empirical results show AutoSchA achieving performance near human experts on Baroque fugue subjects, with ablations highlighting the importance of rhythmic over pitch features. The framework opens avenues for AI-assisted music theory, generation, and broader hierarchical music analysis using graph-based representations.

Abstract

Hierarchical representations provide powerful and principled approaches for analyzing many musical genres. Such representations have been broadly studied in music theory, for instance via Schenkerian analysis (SchA). Hierarchical music analyses, however, are highly cost-intensive; the analysis of a single piece of music requires a great deal of time and effort from trained experts. The representation of hierarchical analyses in a computer-readable format is a further challenge. Given recent developments in hierarchical deep learning and increasing quantities of computer-readable data, there is great promise in extending such work for an automatic hierarchical representation framework. This paper thus introduces a novel approach, AutoSchA, which extends recent developments in graph neural networks (GNNs) for hierarchical music analysis. AutoSchA features three key contributions: 1) a new graph learning framework for hierarchical music representation, 2) a new graph pooling mechanism based on node isolation that directly optimizes learned pooling assignments, and 3) a state-of-the-art architecture that integrates such developments for automatic hierarchical music analysis. We show, in a suite of experiments, that AutoSchA performs comparably to human experts when analyzing Baroque fugue subjects.

AutoSchA: Automatic Hierarchical Music Representations via Multi-Relational Node Isolation

TL;DR

Abstract

AutoSchA: Automatic Hierarchical Music Representations via Multi-Relational Node Isolation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)