Generalized Factor Neural Network Model for High-dimensional Regression
Zichuan Guo, Mihai Cucuringu, Alexander Y. Shestopaloff
TL;DR
The paper tackles high-dimensional regression with latent low-dimensional structure by integrating nonparametric regression, factor models, and neural networks. It introduces differentiable PCA and Soft PCA layers, along with Additive Layers, to build Generalized Factor Neural Networks (GFANN and GFADNN) capable of learning nonlinear factor effects in $p$-dimensional data with small latent dimension $k$. Through simulations and empirical studies on SPY ETF forecasting and FRED-MD macro data, the proposed architectures consistently outperform FAR-NN, FAST-NN, and standard neural networks, especially under nonlinear factor-observation relationships. The work demonstrates that a factor-then-additive architecture efficiently captures hierarchical compositional structure, offering scalable, interpretable, and robust performance for high-dimensional regression tasks with practical impact in finance and macroeconomics.
Abstract
We tackle the challenges of modeling high-dimensional data sets, particularly those with latent low-dimensional structures hidden within complex, non-linear, and noisy relationships. Our approach enables a seamless integration of concepts from non-parametric regression, factor models, and neural networks for high-dimensional regression. Our approach introduces PCA and Soft PCA layers, which can be embedded at any stage of a neural network architecture, allowing the model to alternate between factor modeling and non-linear transformations. This flexibility makes our method especially effective for processing hierarchical compositional data. We explore ours and other techniques for imposing low-rank structures on neural networks and examine how architectural design impacts model performance. The effectiveness of our method is demonstrated through simulation studies, as well as applications to forecasting future price movements of equity ETF indices and nowcasting with macroeconomic data.
