The Sound Demixing Challenge 2023 $\unicode{x2013}$ Cinematic Demixing Track
Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman Solovyev, Alexander Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji
TL;DR
The paper introduces the Cinematic Demixing Track of SDX'23 and its hidden test set CDXDB23 to benchmark dialogue, sound effects, and music separation in real movie audio. It shows that data realism and targeted preprocessing (e.g., vocal-removal, dialogue-first cascades) significantly boost SDR, with up to 5.7 dB gains when extra data are allowed. Key contributions include the CDXDB23 dataset, the dual-leaderboard framework separating data- and algorithm-driven gains, and an analysis of distribution mismatches between synthetic DnR and real cinematic audio, along with strategies to mitigate them. The findings highlight practical pathways to robust cinematic separation, such as data augmentation, model ensembles, and normalization that align synthetic training data with real-world film soundtracks.
Abstract
This paper summarizes the cinematic demixing (CDX) track of the Sound Demixing Challenge 2023 (SDX'23). We provide a comprehensive summary of the challenge setup, detailing the structure of the competition and the datasets used. Especially, we detail CDXDB23, a new hidden dataset constructed from real movies that was used to rank the submissions. The paper also offers insights into the most successful approaches employed by participants. Compared to the cocktail-fork baseline, the best-performing system trained exclusively on the simulated Divide and Remaster (DnR) dataset achieved an improvement of 1.8 dB in SDR, whereas the top-performing system on the open leaderboard, where any data could be used for training, saw a significant improvement of 5.7 dB. A significant source of this improvement was making the simulated data better match real cinematic audio, which we further investigate in detail.
