Table of Contents
Fetching ...

Time-Frequency Filtering Meets Graph Clustering

Marcelo A. Colominas, Stefan Steinerberger, Hau-Tieng Wu

TL;DR

The paper reframes the problem of identifying signal components in a time-frequency representation as a graph clustering task by mapping TF-plane pixels to graph vertices and connecting nearby, energy-rich pairs. It introduces a flexible, nonparametric TF filtering pipeline where components are recovered from masks derived either by connected components or graph clustering, with noise-informed thresholds $\gamma$ and $\tau$ guiding edge formation via $|A_{ij}|\cdot|A_{k\ell}| \ge \tau$. The approach is demonstrated on simulated signals with varying SNR and on real data (bat calls and fetal ECG), showing robustness and the ability to extract multiple components simultaneously without the error accumulation of peeling. The framework generalizes beyond STFT to other TF representations and offers a pathway to improved, one-shot TF-domain extraction across diverse signal modalities.

Abstract

We show that the problem of identifying different signal components from a time-frequency representation can be equivalently phrased as a graph clustering problem: given a graph $G=(V,E)$ one aims to identify `clusters', subgraphs that are strongly connected and have relatively few connections between them. The graph clustering problem is well studied, we show how these ideas can suggest (many) new ways to identify signal components. Numerical experiments illustrate the ideas.

Time-Frequency Filtering Meets Graph Clustering

TL;DR

The paper reframes the problem of identifying signal components in a time-frequency representation as a graph clustering task by mapping TF-plane pixels to graph vertices and connecting nearby, energy-rich pairs. It introduces a flexible, nonparametric TF filtering pipeline where components are recovered from masks derived either by connected components or graph clustering, with noise-informed thresholds and guiding edge formation via . The approach is demonstrated on simulated signals with varying SNR and on real data (bat calls and fetal ECG), showing robustness and the ability to extract multiple components simultaneously without the error accumulation of peeling. The framework generalizes beyond STFT to other TF representations and offers a pathway to improved, one-shot TF-domain extraction across diverse signal modalities.

Abstract

We show that the problem of identifying different signal components from a time-frequency representation can be equivalently phrased as a graph clustering problem: given a graph one aims to identify `clusters', subgraphs that are strongly connected and have relatively few connections between them. The graph clustering problem is well studied, we show how these ideas can suggest (many) new ways to identify signal components. Numerical experiments illustrate the ideas.

Paper Structure

This paper contains 16 sections, 7 equations, 6 figures.

Figures (6)

  • Figure 1: Left: a (very simple) discrete time-frequency representation. Right: pixels are turned into vertices of a graph. We connect two vertices with an edge if the corresponding two pixels are 'close' in the time-frequency plane and have large TF energy.
  • Figure 2: Impulse and chirps. Left: modulus of the STFT with an SNR of 20 dB. Right: selected masks.
  • Figure 3: Linear chirp and Hermite function. Here we show the median over 30 realizations. Top left: modulus of the STFT for an SNR of 40 dB. Top middle: median errors for the linear chirp estimation. Top right: median errors for the Hermite function estimation. Bottom left: modulus of SST2 for an SNR of 40 dB. Bottom middle: boxplots for the errors when estimating the linear chirp. Bottom right: boxplots for the errors when estimating the Hermite function.
  • Figure 4: Sinusoidal and linear chirps. Here we show the median over 30 realizations. Top left: modulus of the STFT for an SNR of 40 dB. Top middle: median errors for the sinusoidal chirp estimation. Top right: median errors for the linear chirp estimation. Bottom left: modulus of SST2 for an SNR of 40 dB. Bottom middle: boxplots for the errors when estimating the sinusoidal chirp. Bottom right: boxplots for the errors when estimating the linear chirp.
  • Figure 5: Bat signal. Left: modulus of the STFT. Middle: selected masks. Right: extracted modes (shifted vertically for visibility purposes).
  • ...and 1 more figures