Janssen 2.0: Audio Inpainting in the Time-frequency Domain
Ondřej Mokrý, Peter Balušík, Pavel Rajmic
TL;DR
The paper tackles inpainting in the time-frequency domain by adapting the Janssen autoregressive method to spectrograms (Janssen-TF) and benchmarking it against a deep-prior TF inpainting approach (DPAI). Janssen-TF casts inpainting as an AR-constrained TF problem solved via ADMM under STFT constraints, achieving superior objective and subjective performance across gap lengths. Experiments on small and larger datasets show Janssen-TF generally outperforms DPAI, with context-aware variants providing additional gains for shorter gaps. The work highlights the value of TF-domain autoregressive priors for audio inpainting and provides open-source MATLAB implementations.
Abstract
The paper focuses on inpainting missing parts of an audio signal spectrogram, i.e., estimating the lacking time-frequency coefficients. The autoregression-based Janssen algorithm, a state-of-the-art for the time-domain audio inpainting, is adapted for the time-frequency setting. This novel method, termed Janssen-TF, is compared with the deep-prior neural network approach using both objective metrics and a subjective listening test, proving Janssen-TF to be superior in all the considered measures.
