Fourier minimization and imputation of time series
Will Burstein, Alex Iosevich, Azita Mayeli, Hari Sarang Nathan
TL;DR
This work studies imputing missing values in time series by formulating imputation as a convex L1-minimization problem in the Fourier domain: given f: $Z_N\to C$ with missing indices M, find g that matches f on M^c while minimizing $||\widehat{g}||_{L^1}$. The approach extends Logan's phenomenon from exact recovery to robust imputation under concentration and noise, providing quantitative bounds (Theorems 'quantitativeLogan', 'looseLogan', 'quantitativerandomLogan', 'quantitativerandomnoisyLogan') that hold under both deterministic and random missingness, including random perturbations and noisy observations. The analysis leverages Bourgain-type results and concentration inequalities (e.g., Talagrand) to establish high-probability recovery for random missingness, complemented by an appendix revisiting the classic DS Logan89 proof. Numerical experiments on a real-world dataset (Australian beer production) show that L1-minimization-imputed values yield lower MAE compared to linear interpolation and ANN baselines, demonstrating practical robustness. Overall, the paper provides a convex, Fourier-domain imputation framework with provable guarantees and demonstrates its effectiveness for time-series data with incomplete observations.
Abstract
One of the most common procedures in modern data analytics is filling in missing values in times series. For a variety of reasons, the data provided by clients to obtain a forecast, or other forms of data analysis, may have missing values, and those values need to be filled in before the data set can be properly analyzed. Many freely available forecasting software packages, such as the sktime library, have built-in mechanisms for filling in missing values. The purpose of this paper is to adapt the classical $L^1$ minimization method for signal recovery to the filling of missing values in times. The theoretical justifications of these methods leverage results by Bourgain (\cite{Bourgain89}), Talagrand (\cite{Talagrand98}), the second and the third listed authors (\cite{IM24}), and the result by the second listed author, Kashin, Limonova and the third listed author (\cite{IKLM24}). Brief numerical tests for these algorithms are given but more extensive will be discussed in a companion paper.
