Table of Contents
Fetching ...

A projector-rank partition theorem for exact degrees of freedom in experimental design

Nagananda K G

Abstract

In many experimental designs -- split-plots, blocked or nested layouts, fractional factorials, and studies with missing or unequal replication -- standard ANOVA procedures no longer tell us exactly how many independent pieces of information each effect truly contributes. We provide a general degrees of freedom $(\mathrm{df})$ partition theorem that resolves this ambiguity. For $N$ observations, we show that the total information in the data (i.e., $N-1$ $\mathrm{df}$) can be split exactly across experimental effects and randomization strata by projecting the data onto each stratum and counting the $\mathrm{df}$ each effect contributes there. This yields integer $\mathrm{df}$ -- not approximations -- for any mix of fixed and random effects, blocking structures, fractionation, or imbalance. This result yields closed-form $\mathrm{df}$ tables for unbalanced split-plot, row-column, lattice, and crossed-nested designs. We introduce practical diagnostics -- the $\mathrm{df}$-retention ratio $ρ$, df deficiency $δ$, and variance-inflation index $α$ -- that measure exactly how many $\mathrm{df}$ an effect retains under blocking or fractionation and the resulting loss of precision, thereby extending Box-Hunter's resolution idea to multi-stratum and incomplete designs. Classical results emerge as corollaries: Cochran's one-stratum identity; Yates's split-plot $\mathrm{df}$; resolution-$R$ identified when an effect retains no $\mathrm{df}$. Empirical studies on split-plot and nested designs, a blocked fractional-factorial design-selection experiment, and timing benchmarks show that our approach delivers calibrated error rates, recovers information to raise power by up to 60% without additional runs, and is orders of magnitude faster than bootstrap-based $\mathrm{df}$ approximations.

A projector-rank partition theorem for exact degrees of freedom in experimental design

Abstract

In many experimental designs -- split-plots, blocked or nested layouts, fractional factorials, and studies with missing or unequal replication -- standard ANOVA procedures no longer tell us exactly how many independent pieces of information each effect truly contributes. We provide a general degrees of freedom partition theorem that resolves this ambiguity. For observations, we show that the total information in the data (i.e., ) can be split exactly across experimental effects and randomization strata by projecting the data onto each stratum and counting the each effect contributes there. This yields integer -- not approximations -- for any mix of fixed and random effects, blocking structures, fractionation, or imbalance. This result yields closed-form tables for unbalanced split-plot, row-column, lattice, and crossed-nested designs. We introduce practical diagnostics -- the -retention ratio , df deficiency , and variance-inflation index -- that measure exactly how many an effect retains under blocking or fractionation and the resulting loss of precision, thereby extending Box-Hunter's resolution idea to multi-stratum and incomplete designs. Classical results emerge as corollaries: Cochran's one-stratum identity; Yates's split-plot ; resolution- identified when an effect retains no . Empirical studies on split-plot and nested designs, a blocked fractional-factorial design-selection experiment, and timing benchmarks show that our approach delivers calibrated error rates, recovers information to raise power by up to 60% without additional runs, and is orders of magnitude faster than bootstrap-based approximations.

Paper Structure

This paper contains 22 sections, 5 theorems, 8 equations, 10 tables.

Key Result

Theorem 2.1

Let $\mathbf{X}_{\mathrm{fix}}=\bigoplus\limits_{\tau(\overline E)=\text{fixed}}\! \mathbf{X}_{\overline E}$ and $\mathbf{Z}=\bigoplus\limits_{s\in\mathcal{S}}\mathbf{Z}_s$ be the fixed- and random-effects design matrices in a linear-mixed model whose treatment structure may be aliased and unbalance both of which are orthogonal direct sums. Therefore, Moreover, $\dim\left(\mathcal{C}_{\overline E

Theorems & Definitions (10)

  • Theorem 2.1
  • proof
  • Proposition 1
  • proof
  • Corollary 1
  • proof
  • Corollary 2
  • proof
  • Corollary 3
  • proof