Directional testing for one-way MANOVA in divergent dimensions
Caizhu Huang, Claudia Di Caterina, Nicola Sartori
TL;DR
This work introduces an exact directional test for the equality of g normal mean vectors under a common covariance in high-dimensional settings, requiring $\sum_{i=1}^g n_i \ge p+g+1$. The method delivers an exact Uniform$(0,1)$ directional $p$-value when covariances are identical, and reduces to Hotelling's $T^2$ for $g=2$; in the more general Behrens-Fisher-like scenario with unequal covariances, the directional test remains competitive and often superior to standard likelihood-based approaches in moderate dimensions. Across extensive simulations, the directional test exhibits robust size control, outperforming LRT, Bartlett-corrected tests, and Skovgaard adjustments in many high-dimensional settings, and showing reasonable resilience to misspecification. Real-data applications in sports physiology and genomics illustrate the method's practical impact for high-dimensional MANOVA problems, with implications for better calibrated inference in biomedical research.
Abstract
Testing the equality of mean vectors across $g$ different groups plays an important role in many scientific fields. In regular frameworks, likelihood-based statistics under the normality assumption offer a general solution to this task. However, the accuracy of standard asymptotic results is not reliable when the dimension $p$ of the data is large relative to the sample size $n_i$ of each group. We propose here an exact directional test for the equality of $g$ normal mean vectors with identical unknown covariance matrix in a high dimensional setting, provided that $\sum_{i=1}^g n_i \ge p+g+1$. In the case of two groups ($g=2$), the directional test coincides with the Hotelling's $T^2$ test. In the more general situation where the $g$ independent groups may have different unknown covariance matrices, although exactness does not hold, simulation studies show that the directional test is more accurate than most commonly used likelihood{-}based solutions, at least in a moderate dimensional setting in which $p=O(n_i^τ)$, $τ\in (0,1)$. Robustness of the directional approach and its competitors under deviation from the assumption of multivariate normality is also numerically investigated. Our proposal is here applied to data on blood characteristics of male athletes and to microarray data storing gene expressions in patients with breast tumors.
