Data-driven Smooth Tests for Normality in ANOVA When the Number of Groups is Large

Peiwen Jia; Xiaojun Song; Haoyu Wei

Data-driven Smooth Tests for Normality in ANOVA When the Number of Groups is Large

Peiwen Jia, Xiaojun Song, Haoyu Wei

Abstract

The normality assumption for random errors is fundamental in the analysis of variance (ANOVA) models. However, it is rarely subjected to formal testing in practice, and theoretically justified procedures are largely unavailable, especially when the number of groups diverges. In this paper, we develop Neyman's smooth tests for assessing normality in a broad class of ANOVA models, allowing the number of groups to diverge. The proposed test statistics are constructed via the Gaussian probability integral transformation of ANOVA residuals. We show that using residuals induces non-negligible parameter estimation effects, whose structure depends on the underlying ANOVA model and plays a crucial role in shaping the form of the test statistics and their asymptotic behavior. Under the null hypothesis of normality, the resulting statistics follow an asymptotic Chi-square distribution, with degrees of freedom determined by the order of the smooth test (i.e., the number of components included in the smooth test). We further propose a modified Schwarz's selection rule to automatically determine the order, thereby yielding fully data-driven smooth tests that require no additional tuning parameters. Simulation studies and a real-data example indicate that the proposed tests perform well in practice and are readily applicable.

Data-driven Smooth Tests for Normality in ANOVA When the Number of Groups is Large

Abstract

Data-driven Smooth Tests for Normality in ANOVA When the Number of Groups is Large

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (14)