The Canonical Decomposition of Factor Models: Weak Factors are Everywhere

Philipp Gersing; Matteo Barigozzi; Christoph Rust; Manfred Deistler

The Canonical Decomposition of Factor Models: Weak Factors are Everywhere

Philipp Gersing, Matteo Barigozzi, Christoph Rust, Manfred Deistler

TL;DR

The paper addresses how static and dynamic factor models relate in high-dimensional time series, introducing a canonical threefold decomposition of $y_{it}$ into a static common component $C_{it}$, a weak common component $e_{it}^hi$, and a dynamic idiosyncratic term $_{it}$, with $e_{it}^hi=hi_{it}-C_{it}$. It reframes the problem in terms of dynamic and static aggregation spaces and proves that the static aggregation space is contained in the dynamic one, enabling a unified representation where weak factors live in the dynamically common space. The authors develop a practical estimation procedure combining dynamic PCA (or spectral-density-based dynamic PCA) to estimate $chi$, followed by static PCA on $chi$ to obtain $C$, with the residual forming the weak component; they establish consistency with explicit rates and illustrate the approach via simulations and a US macroeconomic data application (FRED-MD). Empirically, the weak component explains a meaningful share of variation across many series and improves forecasting relative to the conventional static diffusion-index approach, highlighting the importance of accounting for weak, non-pervasive factors in high-dimensional panels. Overall, the work extends the traditional factor-model toolkit by clarifying the complex link between static and dynamic representations and providing actionable estimation tools for the canonical decomposition and its components.

Abstract

There are two approaches to time series approximate factor models: the static factor model, where the factors are loaded contemporaneously by the common component, and the Generalised Dynamic Factor Model, where the factors are loaded with lags. In this paper we derive a canonical decomposition which nests both models by introducing the weak common component which is the difference between the dynamic- and the static common component. Such component is driven by potentially infinitely many non-pervasive weak factors which live in the dynamically common space (not to be confused with rate-weak factors, being pervasive but associated with a slower rate). Our result shows that the relation between the two approaches is far more rich and complex than what usually assumed. We exemplify why the weak common component shall not be neglected by means of theoretical and empirical examples. Furthermore, we propose a simple estimation procedure for the canonical decomposition. Our empirical estimates on US macroeconomic data reveal that the weak common component can account for a large part of the variation of individual variables. Furthermore in a pseudo real-time forecasting evaluation for industrial production and inflation, we show that gains can be obtained from considering the dynamic approach over the static approach.

The Canonical Decomposition of Factor Models: Weak Factors are Everywhere

TL;DR

The paper addresses how static and dynamic factor models relate in high-dimensional time series, introducing a canonical threefold decomposition of

into a static common component

, a weak common component

, and a dynamic idiosyncratic term

, with

. It reframes the problem in terms of dynamic and static aggregation spaces and proves that the static aggregation space is contained in the dynamic one, enabling a unified representation where weak factors live in the dynamically common space. The authors develop a practical estimation procedure combining dynamic PCA (or spectral-density-based dynamic PCA) to estimate

, followed by static PCA on

to obtain

, with the residual forming the weak component; they establish consistency with explicit rates and illustrate the approach via simulations and a US macroeconomic data application (FRED-MD). Empirically, the weak component explains a meaningful share of variation across many series and improves forecasting relative to the conventional static diffusion-index approach, highlighting the importance of accounting for weak, non-pervasive factors in high-dimensional panels. Overall, the work extends the traditional factor-model toolkit by clarifying the complex link between static and dynamic representations and providing actionable estimation tools for the canonical decomposition and its components.

Abstract

Paper Structure (32 sections, 34 theorems, 63 equations, 5 figures, 10 tables)

This paper contains 32 sections, 34 theorems, 63 equations, 5 figures, 10 tables.

Introduction
Overview of main results
The stacking approach
A Motivating Example
Representation Theory
The Dynamic and the Static Approach in terms of Aggregation
The Canonical Decomposition of Approximate Factor Models
Weak, Non-Pervasive Factors in the Dynamically Common Space
Estimation
Estimation in practice
Part I.a - Estimation of the dynamic common component as in forni2000generalized
Part I.b - Estimation of the dynamic common component as in forni2017dynamic
Part II - Estimation of the static common component
Consistency
Simulation Experiments
...and 17 more sections

Key Result

Theorem 1

Let AA: stat hold for $(y_{it})$. Then, the following holds.

Figures (5)

Figure 1: DGP1. Mean Squared Error of $\widehat{\chi}_{1t}$ over 500 replications. spcar: estimation with static principal components with r$=1,2,3,5,9$, dpca: estimation by dynamic principal components with $q = 1$, dlreg: distributed lag regression computed by regression on the first principal component and its first lag.
Figure 4: Share of variance explained by each component per variable with $q = 4$ and $r=8$. Estimates are obtained by using part II.b when estimating $C_{it}$. Here var_statCCb$= EV_i^C$ (given in \ref{['eq:EVCa']}), var_weakCCb$=EV_i^{e^\chi}$ (given in \ref{['eq:EVWC']}). Last, var_ICb is the variance explained by the dynamic idiosyncratic component, which is given by $EV_i^\xi=1-EV_i^C-EV_i^{e^\chi}$.
Figure 5: Share of variance explained by each component per variable with $q = 4$ and $r=6$. Estimates are obtained by using part II.b when estimating $C_{it}$. Here var_statCCb$= EV_i^C$ (given in \ref{['eq:EVCa']}), var_weakCCb$=EV_i^{e^\chi}$ (given in \ref{['eq:EVWC']}). Last, var_ICb is the variance explained by the dynamic idiosyncratic component, which is given by $EV_i^\xi=1-EV_i^C-EV_i^{e^\chi}$.
Figure 6: Share of variance explained by each component per variable with $q = 4$ and $r=8$. Estimates are obtained by using part II.a when estimating $C_{it}$. Here var_statCCa$= EV_i^C$ (given in \ref{['eq:EVCa']}), var_weakCCa$=EV_i^{e^\chi}$ (given in \ref{['eq:EVWC']}). Last, var_ICa is the variance explained by the dynamic idiosyncratic component, which is given by $EV_i^\xi=1-EV_i^C-EV_i^{e^\chi}$.
Figure 7: Share of variance explained by each component per variable with $q = 4$ and $r=12$. Estimates are obtained by using part II.b when estimating $C_{it}$. Here var_statCCb$= EV_i^C$ (given in \ref{['eq:EVCa']}), var_weakCCb$=EV_i^{e^\chi}$ (given in \ref{['eq:EVWC']}). Last, var_ICb is the variance explained by the dynamic idiosyncratic component, which is given by $EV_i^\xi=1-EV_i^C-EV_i^{e^\chi}$.

Theorems & Definitions (50)

Theorem 1
Remark 1
Remark 2
Remark 3
Remark 4
Example 1
Example 2
Theorem 2
Example 3
Proposition 1: Consistency of part I.a - $\widehat{\chi}_t^n$ as in forni2000generalized
...and 40 more

The Canonical Decomposition of Factor Models: Weak Factors are Everywhere

TL;DR

Abstract

The Canonical Decomposition of Factor Models: Weak Factors are Everywhere

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (50)