A Dataset and Baselines for Measuring and Predicting the Music Piece Memorability
Li-Yang Tseng, Tzu-Ling Lin, Hong-Han Shuai, Jen-Wei Huang, Wen-Whei Chang
TL;DR
This work defines music memorability regression (MMR) and introduces the YouTube Music Memorability (YTMM) dataset collected via a novel memory-game procedure to obtain reliable memorability scores. It evaluates both handcrafted, interpretable features and end-to-end mel-spectrogram–based approaches, with a strong emphasis on interpretability using SHAP. The experiments show that explainable handcrafted features (EHC) paired with SVR/MLP provide the best correlations under limited data, while self-supervised transformers benefit from pitch-shift augmentation. The dataset and baselines enable systematic study of which musical attributes drive memorability, with potential applications in recommendation and style transfer, and the work highlights the importance of data efficiency and interpretability in MUSIC memorability research.
Abstract
Nowadays, humans are constantly exposed to music, whether through voluntary streaming services or incidental encounters during commercial breaks. Despite the abundance of music, certain pieces remain more memorable and often gain greater popularity. Inspired by this phenomenon, we focus on measuring and predicting music memorability. To achieve this, we collect a new music piece dataset with reliable memorability labels using a novel interactive experimental procedure. We then train baselines to predict and analyze music memorability, leveraging both interpretable features and audio mel-spectrograms as inputs. To the best of our knowledge, we are the first to explore music memorability using data-driven deep learning-based methods. Through a series of experiments and ablation studies, we demonstrate that while there is room for improvement, predicting music memorability with limited data is possible. Certain intrinsic elements, such as higher valence, arousal, and faster tempo, contribute to memorable music. As prediction techniques continue to evolve, real-life applications like music recommendation systems and music style transfer will undoubtedly benefit from this new area of research.
