Deep Variable-Block Chain with Adaptive Variable Selection
Lixiang Zhang, Lin Lin, Jia Li
TL;DR
This work introduces Deep Variable-Block Chain (DVC), a chain-structured, LSTM-inspired neural network that operates on blocks of features $X^{(v)}$ partitioned from a high-dimensional vector $X \in \mathbb{R}^p$, enabling effective modeling of non-grid variables. A forward greedy search constructs the block chain, with a global selection length $S$ determined by cross-validated error. To capture heterogeneity across data regions, the authors add Adaptive Variable Selection (AVS) via a decision-tree $\mathcal{T}_{VS}$ that assigns a region-specific number of blocks $\tilde{\nu}$ to use, forming DVC-AVS when combined with DVC predictions. Experiments on simulated and real biomedical datasets show that DVC (and especially DVC-AVS) achieves higher accuracy at reduced dimensionality and reveals region-specific sets of important variables, including robustness to highly correlated features. The approach offers interpretable, block-wise variable interactions and a principled framework for adaptive feature selection in high-dimensional non-grid data, with potential impact in biomarker discovery and precision medicine.
Abstract
The architectures of deep neural networks (DNN) rely heavily on the underlying grid structure of variables, for instance, the lattice of pixels in an image. For general high dimensional data with variables not associated with a grid, the multi-layer perceptron and deep belief network are often used. However, it is frequently observed that those networks do not perform competitively and they are not helpful for identifying important variables. In this paper, we propose a framework that imposes on blocks of variables a chain structure obtained by step-wise greedy search so that the DNN architecture can leverage the constructed grid. We call this new neural network Deep Variable-Block Chain (DVC). Because the variable blocks are used for classification in a sequential manner, we further develop the capacity of selecting variables adaptively according to a number of regions trained by a decision tree. Our experiments show that DVC outperforms other generic DNNs and other strong classifiers. Moreover, DVC can achieve high accuracy at much reduced dimensionality and sometimes reveals drastically different sets of relevant variables for different regions.
