Musical Phrase Segmentation via Grammatical Induction
Reed Perkins, Dan Ventura
TL;DR
The paper investigates musical phrase segmentation via grammatical induction, evaluating five grammar-learning algorithms across three datasets using multiple viewpoint representations of music. It demonstrates that the offline LongestFirst method, when paired with duration-focused viewpoint encodings, delivers the strongest phrase-detection performance and reveals hierarchical structures in learned grammars. The work provides a principled, data-driven approach to segmenting symbolic musical sequences and offers insights into how rhythmic and transpositional invariances affect segmentation quality, with implications for data-driven procedural music generation. Overall, it advances understanding of how context-free grammars can model musical phrases and how viewpoint design shapes segmentation outcomes.
Abstract
We outline a solution to the challenge of musical phrase segmentation that uses grammatical induction algorithms, a class of algorithms which infer a context-free grammar from an input sequence. We analyze the performance of five grammatical induction algorithms on three datasets using various musical viewpoint combinations. Our experiments show that the LONGESTFIRST algorithm achieves the best F1 scores across all three datasets and that input encodings that include the duration viewpoint result in the best performance.
