In-depth analysis of music structure as a text network
Ping-Rui Tsai, Yen-Ting Chou, Nathan-Christopher Wang, Hui-Ling Chen, Hong-Yue Huang, Zih-Jia Luo, Tzay-Ming Hong
TL;DR
This work treats music as a natural-language-like system and builds an Essential Element Network (EEN) by decomposing musical content into Essential Elements (EE) and defining CC-based words, enforcing Zipf-like statistics to compare CPP periods. It introduces a linking rule with $I$, a threshold $I_m$, and a CC-based word construction, then maps audio to note-time-space and optimizes weights $(w_1,...,w_4)$ from 4032 configurations to fit Zipf's law and maximize word-type diversity. Through 2D EEN representations and CNN-based classification, the paper shows period-specific weight profiles (e.g., high $w_3$ for Baroque) and analyzes CC trends, histograms, and robustness to word removal, achieving music-vs-non-music discrimination. The framework links music structure to natural language processing and knowledge graphs, enabling quantitative analysis, potential music generation via GANs, and cross-disciplinary insights in anthropology and cognition.
Abstract
Music, enchanting and poetic, permeates every corner of human civilization. Although music is not unfamiliar to people, our understanding of its essence remains limited, and there is still no universally accepted scientific description. This is primarily due to music being regarded as a product of both reason and emotion, making it difficult to define. In this article, we focus on the fundamental elements of music and construct an evolutionary network from the perspective of music as a natural language, aligning with the statistical characteristics of texts. Through this approach, we aim to comprehend the structural differences in music across different periods, enabling a more scientific exploration of music. Relying on the advantages of structuralism, we can concentrate on the relationships and order between the physical elements of music, rather than getting entangled in the blurred boundaries of science and philosophy. The scientific framework we present not only conforms to past conclusions in music, but also serves as a bridge that connects music to natural language processing and knowledge graphs.
