Alpha-Trimming: Locally Adaptive Tree Pruning for Random Forests
Nikola Surjanovic, Andrew Henrey, Thomas M. Loughin
TL;DR
The paper tackles the bias-variance tension in random forests by proposing alpha-trimmed RFs that prune regression trees adaptively according to local signal-to-noise ratio. It develops Accumulated Information Pruning (AIP) to perform a bottom-up, information-criterion–driven pruning of each tree, extended to ensembles through a tunable parameter α that governs pruning strength without re-fitting. The information criteria for pruning are formalized via a modified Bayesian information criterion with penalties $P_{0,n}$ and $P_{1,n}$, and are shown to be statistically consistent with respect to model selection between tree-root and tree-stump configurations; pruning amount is related to the data’s SNR and remains computationally efficient with complexity $O(Bn)$ for fixed α. Empirically, the alpha-trimmed RFs frequently reduce mean squared prediction error on 46 data sets compared with standard RFs and perform competitively with, or better than, RFs tuned by global node-size adjustments, while avoiding cross-validation. The method offers a practical, parallelizable, and refitting-free approach to locally adaptive tree sizing in RFs, improving predictive performance in regions of varying SNR.
Abstract
We demonstrate that adaptively controlling the size of individual regression trees in a random forest can improve predictive performance, contrary to the conventional wisdom that trees should be fully grown. A fast pruning algorithm, alpha-trimming, is proposed as an effective approach to pruning trees within a random forest, where more aggressive pruning is performed in regions with a low signal-to-noise ratio. The amount of overall pruning is controlled by adjusting the weight on an information criterion penalty as a tuning parameter, with the standard random forest being a special case of our alpha-trimmed random forest. A remarkable feature of alpha-trimming is that its tuning parameter can be adjusted without refitting the trees in the random forest once the trees have been fully grown once. In a benchmark suite of 46 example data sets, mean squared prediction error is often substantially lowered by using our pruning algorithm and is never substantially increased compared to a random forest with fully-grown trees at default parameter settings.
