Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments

Hanbin Bae; Byungjun Kang; Jiwon Kim; Jaeyong Hwang; Hosang Sung; Hoon-Young Cho

Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments

Hanbin Bae, Byungjun Kang, Jiwon Kim, Jaeyong Hwang, Hosang Sung, Hoon-Young Cho

TL;DR

The paper tackles single-channel distance-based source separation (DSS) in outdoor and indoor environments and proposes a mobile-friendly architecture that leverages TS-Conformer blocks, linear relation-aware self-attention (RSA), and the TensorFlow Lite GPU delegate to achieve energy-efficient, real-time inference. The signal model partitions the mixture into near and far sources using impulse responses simulated by Pyroomacoustics, and the method is trained on mixed outdoor–indoor data, including challenging outdoor noise. A Baseline CMGAN is extended with a linear RSA to reduce quadratic complexity from $O(N^2 d)$ to $O(N d^2)$ while maintaining separation quality, and mobile-GPU optimizations enable practical on-device deployment. Experiments on simulated and real outdoor data demonstrate substantial energy and speed gains on mobile hardware, with outdoor training yielding improved performance over indoor-only training.

Abstract

This study emphasizes the significance of exploring distance-based source separation (DSS) in outdoor environments. Unlike existing studies that primarily focus on indoor settings, the proposed model is designed to capture the unique characteristics of outdoor audio sources. It incorporates advanced techniques, including a two-stage conformer block, a linear relation-aware self-attention (RSA), and a TensorFlow Lite GPU delegate. While the linear RSA may not capture physical cues as explicitly as the quadratic RSA, the linear RSA enhances the model's context awareness, leading to improved performance on the DSS that requires an understanding of physical cues in outdoor and indoor environments. The experimental results demonstrated that the proposed model overcomes the limitations of existing approaches and considerably enhances energy efficiency and real-time inference speed on mobile devices.

Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments

TL;DR

while maintaining separation quality, and mobile-GPU optimizations enable practical on-device deployment. Experiments on simulated and real outdoor data demonstrate substantial energy and speed gains on mobile hardware, with outdoor training yielding improved performance over indoor-only training.

Abstract

Paper Structure (15 sections, 3 equations, 3 figures, 3 tables)

This paper contains 15 sections, 3 equations, 3 figures, 3 tables.

Introduction
Problem Formulation
Proposed Architecture
Baseline Architecture
Linearization of relation-aware self-attention
Mobile GPU Utilization
Benchmark Test of Architectures on Mobile Device
Experiments
Simulation outdoor and indoor environments
Datasets
Training setup
Evaluation of the basic performances of DSS models in indoor environments
Evaluation in outdoor environments
Limitation and future work
Conclusion

Figures (3)

Figure 1: Visual representation of DSS with background noise.
Figure 2: Schematic diagrams of $\textrm{M}_{\textrm{Baseline}}$ architecture.
Figure 3: Results for a real outdoor sample.

Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments

TL;DR

Abstract

Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments

Authors

TL;DR

Abstract

Table of Contents

Figures (3)