Positive and Risky Message Assessment for Music Products
Yigeng Zhang, Mahsa Shafaei, Fabio A. González, Thamar Solorio
TL;DR
This work defines a novel task for assessing both positive and potentially risky messages in music lyrics and builds a multi-task benchmark across five aspects with ordinal severity. It introduces an emotion-guided twin encoder and an aspect-aware attention mechanism, augmented with three ordinality-enforcement strategies, to jointly predict all five aspects. Empirical results show state-of-the-art performance on the benchmark, with thorough analyses including ablation, saliency, error analysis, case studies, and a case study using an LLM as a surrogate evaluator. The work provides public code and data procedures, highlighting practical implications for music content advisories, and discusses ethical and societal considerations for automated content assessment.
Abstract
In this work, we introduce a pioneering research challenge: evaluating positive and potentially harmful messages within music products. We initiate by setting a multi-faceted, multi-task benchmark for music content assessment. Subsequently, we introduce an efficient multi-task predictive model fortified with ordinality-enforcement to address this challenge. Our findings reveal that the proposed method not only significantly outperforms robust task-specific alternatives but also possesses the capability to assess multiple aspects simultaneously. Furthermore, through detailed case studies, where we employed Large Language Models (LLMs) as surrogates for content assessment, we provide valuable insights to inform and guide future research on this topic. The code for dataset creation and model implementation is publicly available at https://github.com/RiTUAL-UH/music-message-assessment.
