Table of Contents
Fetching ...

Diff-MSTC: A Mixing Style Transfer Prototype for Cubase

Soumya Sai Vanka, Lennart Hannink, Jean-Baptiste Rolland, George Fazekas

TL;DR

The Diff-MSTC prototype, which integrates the Diff-MST model into Steinberg's digital audio workstation (DAW), Cubase, is a first-of-its-kind prototype integrated into a DAW and lets users input context through a reference song, followed by fine-tuning of audio effects in a traditional manner.

Abstract

In our demo, participants are invited to explore the Diff-MSTC prototype, which integrates the Diff-MST model into Steinberg's digital audio workstation (DAW), Cubase. Diff-MST, a deep learning model for mixing style transfer, forecasts mixing console parameters for tracks using a reference song. The system processes up to 20 raw tracks along with a reference song to predict mixing console parameters that can be used to create an initial mix. Users have the option to manually adjust these parameters further for greater control. In contrast to earlier deep learning systems that are limited to research ideas, Diff-MSTC is a first-of-its-kind prototype integrated into a DAW. This integration facilitates mixing decisions on multitracks and lets users input context through a reference song, followed by fine-tuning of audio effects in a traditional manner.

Diff-MSTC: A Mixing Style Transfer Prototype for Cubase

TL;DR

The Diff-MSTC prototype, which integrates the Diff-MST model into Steinberg's digital audio workstation (DAW), Cubase, is a first-of-its-kind prototype integrated into a DAW and lets users input context through a reference song, followed by fine-tuning of audio effects in a traditional manner.

Abstract

In our demo, participants are invited to explore the Diff-MSTC prototype, which integrates the Diff-MST model into Steinberg's digital audio workstation (DAW), Cubase. Diff-MST, a deep learning model for mixing style transfer, forecasts mixing console parameters for tracks using a reference song. The system processes up to 20 raw tracks along with a reference song to predict mixing console parameters that can be used to create an initial mix. Users have the option to manually adjust these parameters further for greater control. In contrast to earlier deep learning systems that are limited to research ideas, Diff-MSTC is a first-of-its-kind prototype integrated into a DAW. This integration facilitates mixing decisions on multitracks and lets users input context through a reference song, followed by fine-tuning of audio effects in a traditional manner.

Paper Structure

This paper contains 6 sections, 3 figures.

Figures (3)

  • Figure 1: A High-level Overview of Diff-MST vanka2024diff
  • Figure 2: Diff-MSTC in Cubase
  • Figure :