A simplified directional KeRF algorithm

Iakovidis Isidoros; Nicola Arcozzi

A simplified directional KeRF algorithm

Iakovidis Isidoros, Nicola Arcozzi

TL;DR

The paper addresses kernel-based random forests (KeRF) and their interpolation properties, introducing a simplified directional KeRF to reduce data-dependent splitting. It proves that the simplified directional KeRF is asymptotically equivalent to the centered KeRF as the number of trees $M$ tends to infinity, and it derives a kernel representation for the estimator. It provides improved convergence rates in the mean interpolation regime for the centered KeRF and demonstrates, via numerical experiments, that the two approaches behave identically for moderate to large $M$. The results offer a simpler, kernel-based alternative with theoretical guarantees and practical viability in low-dimensional settings.

Abstract

Random forest methods belong to the class of non-parametric machine learning algorithms. They were first introduced in 2001 by Breiman and they perform with accuracy in high dimensional settings. In this article, we consider, a simplified kernel-based random forest algorithm called simplified directional KeRF (Kernel Random Forest). We establish the asymptotic equivalence between simplified directional KeRF and centered KeRF, with additional numerical experiments supporting our theoretical results.

A simplified directional KeRF algorithm

TL;DR

tends to infinity, and it derives a kernel representation for the estimator. It provides improved convergence rates in the mean interpolation regime for the centered KeRF and demonstrates, via numerical experiments, that the two approaches behave identically for moderate to large

. The results offer a simpler, kernel-based alternative with theoretical guarantees and practical viability in low-dimensional settings.

Abstract

Paper Structure (10 sections, 14 theorems, 60 equations, 7 figures)

This paper contains 10 sections, 14 theorems, 60 equations, 7 figures.

Introduction
Contributions of the article
Notation and preliminaries
Kernel Random Forest algorithm
The Centered KeRF algorithm
Simplified directional KeRF
Plots and experiments.
Interpolating random trees
Conclusions
Proofs of theorems

Key Result

Theorem 1

$\textbf{Y}=m(\textbf{X}) +\epsilon$ where $\epsilon$ is a zero mean Gaussian noise with finite variance $\sigma$ independent of $\textbf{X}$. Assuming also that $\textbf{X}$ is uniformly distributed in $[0,1]^d$ and $m$ is a Lipschitz function. Then there exists constants $c_1, c_2$ depending on $d

Figures (7)

Figure 1: Centered algorithm with tree level $k=1$ with the convention that $1$ corresponds to $x_1$ axis and $2$ to the $x_2$ axis.
Figure 2: Centered algorithm with tree level $k=2$ with the convention that $1$ corresponds to $x_1$ axis and $2$ to the $x_2$ axis.
Figure 3: Centered algorithm with tree level $k=1$ with the convention that $1$ corresponds to $x_1$ axis and $2$ to the $x_2$ axis.
Figure 4: Centered algorithm with tree level $k=2$ with the convention that $1$ corresponds to $x_1$ axis and $2$ to the $x_2$ axis.
Figure 5: Comparison of $L-$error and standard deviation.
...and 2 more figures

Theorems & Definitions (24)

Theorem
Definition 1
Definition 2
Proposition 1: Scornet S, Proposition 1
Definition 3
Proposition 2: Scornet S, Proposition 2
Proposition 3
Theorem 1
Corollary 1
Definition 4
...and 14 more

A simplified directional KeRF algorithm

TL;DR

Abstract

A simplified directional KeRF algorithm

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (24)