Adaptive monotonicity testing in sublinear time
Housen Li, Zhi Liu, Axel Munk
TL;DR
This work tackles monotonicity testing for nonparametric regression under Gaussian noise by introducing FOMT, a sparse, randomly sampled collection of local tests based on local polynomial estimation that achieves minimax separation rates over Hölder classes $\Sigma(\beta,L)$ for $\beta\in(0,2]$ while delivering sublinear runtime in many scenarios. The method leverages refined variance bounds that exploit correlation between nearby local estimators and employs a spot-check style sampling to maintain computational efficiency. It further develops an adaptive variant, CALM, and an adaptive testing procedure A-FOMT that retain optimal statistical performance with essentially the same computational cost as if the smoothness were known. Extensive simulations show FOMT and A-FOMT outperform or compete with existing minimax procedures in speed, while maintaining strong detection power, validating their practical viability for large-scale data analysis.
Abstract
Modern large-scale data analysis increasingly faces the challenge of achieving computational efficiency as well as statistical accuracy, as classical statistically efficient methods often fall short in the first regard. In the context of testing monotonicity of a regression function, we propose FOMT (Fast and Optimal Monotonicity Test), a novel methodology tailored to meet these dual demands. FOMT employs a sparse collection of local tests, strategically generated at random, to detect violations of monotonicity scattered throughout the domain of the regression function. This sparsity enables significant computational efficiency, achieving sublinear runtime in most cases, and quasilinear runtime (i.e., linear up to a log factor) in the worst case. In contrast, existing statistically optimal tests typically require at least quadratic runtime. FOMT's statistical accuracy is achieved through the precise calibration of these local tests and their effective combination, ensuring both sensitivity to violations and control over false positives. More precisely, we show that FOMT separates the null and alternative hypotheses at minimax optimal rates over Hölder function classes of smoothness order in $(0,2]$. Further, when the smoothness is unknown, we introduce an adaptive version of FOMT, based on a modified Lepskii principle, which attains statistical optimality and meanwhile maintains the same computational complexity as if the intrinsic smoothness were known. Extensive simulations confirm the competitiveness and effectiveness of both FOMT and its adaptive variant.
