Neural Random Forest Imitation
Christoph Reinders, Bodo Rosenhahn
TL;DR
Neural Random Forest Imitation (NRFI) tackles data scarcity by implicitly transforming random forests into neural networks through imitation learning. It generates labeled data by analyzing RF decision boundaries and guided routing, then trains a small, differentiable network to mimic the forest's predictions. NRFI achieves RF-level accuracy with orders of magnitude fewer parameters and supports end-to-end optimization and warm-starting, outperforming direct RF-to-NN mappings in efficiency and scalability. The approach scales to complex classifiers and integrates into trainable pipelines, enabling rapid deployment in low-data regimes.
Abstract
We present Neural Random Forest Imitation - a novel approach for transforming random forests into neural networks. Existing methods propose a direct mapping and produce very inefficient architectures. In this work, we introduce an imitation learning approach by generating training data from a random forest and learning a neural network that imitates its behavior. This implicit transformation creates very efficient neural networks that learn the decision boundaries of a random forest. The generated model is differentiable, can be used as a warm start for fine-tuning, and enables end-to-end optimization. Experiments on several real-world benchmark datasets demonstrate superior performance, especially when training with very few training examples. Compared to state-of-the-art methods, we significantly reduce the number of network parameters while achieving the same or even improved accuracy due to better generalization.
