Discrete Chi-Square Method beats Discrete Fourier Transform and other similar parametric time series analysis methods
Lauri Jetsu
TL;DR
The paper presents Discrete Chi-square Method (DCM) as a well-posed computational approach to nonlinear time-series analysis with unknown trends and multiple signals, contrasting it with the traditional DFT. By recasting g(t) as a sum of a harmonic component and an unknown trend, and by fixing frequencies to render the rest linear, DCM performs a massive yet structured set of linear LS fits and uses Fisher-test and Predictivity-test to select the best model. Across seven simulated data sets that challenge DFT (short windows, trends, near-equal or non-sinusoidal signals, uneven spacing), DCM reliably detects the true signals and trends while DFT often fails or produces biased periods and unstable residuals. The work demonstrates that, with sufficient data size and accuracy, DCM can forecast and postdict phenomena, providing a practical framework for complex time-series analysis beyond traditional spectral methods. It highlights the method’s potential for broad application in forecasting, non-stationary contexts, and unevenly sampled data, while acknowledging computational demands and the need for careful model selection and validation.
Abstract
We compare two time series analysis methods, the Discrete Fourier Transform (DFT) and our Discrete Chi-square Method (DCM). DCM is designed for detecting signal(-s) superimposed on an unknown trend. The analytical solution for the non-linear DCM model is an ill-posed problem. We present a computational statistical well-posed solution for this problem. The backbone of DCM is the Gauss-Markov theorem that the least squares fit is the best unbiased estimator for linear regression models. DCM can not fail because this simple time series analysis method computes a massive number of linear least squares fits. Hence, the data spacing, even or uneven, is irrelevant. We use the Fisher-test to identify the best DCM model from all alternative tested DCM models. This best model must also pass our Predictivity-test. Our analyses of simulated complex data sets expose the weaknesses of DFT and the efficiency of DCM. The DFT and DCM are frequency-domain parametric methods. There are many other similar parametric methods. Just like DFT, those other methods suffer from their own particular application limitations. The list of those limitations is long. DCM suffers from none of those limitations. The performance of DCM depends only on the data set size and accuracy. DCM is an ideal forecasting method because the data set time span $(ΔT)$ is irrelevant. It does not matter how long $(ΔT)$ and/or complex the data set is because DCM will inevitably detect the signal(-s) and the trend when the data set size $(n)$ and/or accuracy $(σ)$ become adequate.
