Table of Contents
Fetching ...

ODTE -- An ensemble of multi-class SVM-based oblique decision trees

Ricardo Montañana, José A. Gámez, José M. Puerta

TL;DR

ODTE addresses multiclass classification by embedding multiclass strategies at every decision node using oblique decision trees. The STree base learner trains several SVM-based binary tests per node and retains the split that minimizes impurity, enabling direct multiclass handling within the tree; ODTE builds bootstrap-driven STrees and aggregates predictions via majority voting. Across 49 datasets, ODTE (and its tuned variant ODTET) outperforms competing oblique-tree ensembles and strong baselines, often producing more compact trees. The work provides open-source implementations to support reproducibility and future exploration in structured-data classification with oblique trees.

Abstract

We propose ODTE, a new ensemble that uses oblique decision trees as base classifiers. Additionally, we introduce STree, the base algorithm for growing oblique decision trees, which leverages support vector machines to define hyperplanes within the decision nodes. We embed a multiclass strategy -- one-vs-one or one-vs-rest -- at the decision nodes, allowing the model to directly handle non-binary classification tasks without the need to cluster instances into two groups, as is common in other approaches from the literature. In each decision node, only the best-performing model SVM -- the one that minimizes an impurity measure for the n-ary classification -- is retained, even if the learned SVM addresses a binary classification subtask. An extensive experimental study involving 49 datasets and various state-of-the-art algorithms for oblique decision tree ensembles has been conducted. Our results show that ODTE ranks consistently above its competitors, achieving significant performance gains when hyperparameters are carefully tuned. Moreover, the oblique decision trees learned through STree are more compact than those produced by other algorithms evaluated in our experiments.

ODTE -- An ensemble of multi-class SVM-based oblique decision trees

TL;DR

ODTE addresses multiclass classification by embedding multiclass strategies at every decision node using oblique decision trees. The STree base learner trains several SVM-based binary tests per node and retains the split that minimizes impurity, enabling direct multiclass handling within the tree; ODTE builds bootstrap-driven STrees and aggregates predictions via majority voting. Across 49 datasets, ODTE (and its tuned variant ODTET) outperforms competing oblique-tree ensembles and strong baselines, often producing more compact trees. The work provides open-source implementations to support reproducibility and future exploration in structured-data classification with oblique trees.

Abstract

We propose ODTE, a new ensemble that uses oblique decision trees as base classifiers. Additionally, we introduce STree, the base algorithm for growing oblique decision trees, which leverages support vector machines to define hyperplanes within the decision nodes. We embed a multiclass strategy -- one-vs-one or one-vs-rest -- at the decision nodes, allowing the model to directly handle non-binary classification tasks without the need to cluster instances into two groups, as is common in other approaches from the literature. In each decision node, only the best-performing model SVM -- the one that minimizes an impurity measure for the n-ary classification -- is retained, even if the learned SVM addresses a binary classification subtask. An extensive experimental study involving 49 datasets and various state-of-the-art algorithms for oblique decision tree ensembles has been conducted. Our results show that ODTE ranks consistently above its competitors, achieving significant performance gains when hyperparameters are carefully tuned. Moreover, the oblique decision trees learned through STree are more compact than those produced by other algorithms evaluated in our experiments.

Paper Structure

This paper contains 12 sections, 1 figure, 7 tables, 2 algorithms.

Figures (1)

  • Figure 1: Different decision boundaries and their corresponding decision trees: (left) axis-parallel decision boundary and simple DT; (middle) non axis-parallel decision boundary and complex DT because of the ladder effect; (right) non axis-parallel decision boundary and simple oblique DT.