Subband Splitting: Simple, Efficient and Effective Technique for Solving Block Permutation Problem in Determined Blind Source Separation
Kazuki Matsumoto, Kohei Yatabe
TL;DR
This paper tackles the block permutation problem in determined blind source separation by introducing subband splitting (SS), which partitions the frequency axis into overlapping subbands and applies existing BSS methods sequentially. The approach preserves permutation alignment across subbands by propagating initialization through overlaps and employs shift-based subband generation controlled by parameters $(\theta_W, \theta_\Delta)$, without modifying the underlying BSS algorithms. When combined with IVA and ILRMA (as SS-IVA and SS-ILRMA), the method significantly improves separation performance and convergence speed, achieving near-oracle results for SS-ILRMA on speech and strong gains on music signals. The findings suggest substantial practical impact for robust, efficient BSS in reverberant environments, with potential for real-time deployment and integration with advanced BSS models.
Abstract
Solving the permutation problem is essential for determined blind source separation (BSS). Existing methods, such as independent vector analysis (IVA) and independent low-rank matrix analysis (ILRMA), tackle the permutation problem by modeling the co-occurrence of the frequency components of source signals. One of the remaining challenges in these methods is the block permutation problem, which may cause severe performance degradation. In this paper, we propose a simple and effective technique for solving the block permutation problem. The proposed technique splits the entire frequency bands into several overlapping subbands and sequentially applies BSS methods (e.g., IVA, ILRMA, or any other method) to each subband. Since the splitting reduces the size of the problem, the BSS methods can effectively work in each subband. Then, the permutations among the subbands are aligned by using the separation result in one subband as the initial values for the other subbands. Additionally, we propose SS-IVA and SS-ILRMA by combining subband splitting (SS) with IVA and ILRMA. Experimental results demonstrated that our technique remarkably improves the separation performance without increasing computational cost. In particular, our SS-ILRMA achieved the separation performance comparable to the oracle method (frequency-domain independent component analysis with the ideal permutation solver). Moreover, SS-ILRMA converged faster than conventional IVA and ILRMA.
