Branches: Efficiently Seeking Optimal Sparse Decision Trees with AO*
Ayman Chaouki, Jesse Read, Albert Bifet
TL;DR
Branches introduces an AO*-type algorithm operating on an AND/OR graph to efficiently seek optimal sparse decision trees with a joint accuracy–sparsity objective ${\mathcal{H}}_{\lambda}(T) = {\mathcal{H}}(T) - \lambda {\mathcal{S}}(T)$. It defines a Purification Bound-based heuristic and proves optimality with quantified complexity, while supporting non-binary, multi-way splits via ordinal encoding. Empirical results show Branches often outperforms state-of-the-art DFS/BFS methods in runtime and iterations, and it provides anytime solutions even under time limits. The work highlights practical gains in interpretability and scalability for large, real-world datasets, and suggests avenues for faster implementations and hybrid search strategies in future work.
Abstract
Decision Tree (DT) Learning is a fundamental problem in Interpretable Machine Learning, yet it poses a formidable optimisation challenge. Practical algorithms have recently emerged, primarily leveraging Dynamic Programming and Branch & Bound. However, most of these approaches rely on a Depth-First-Search strategy, which is inefficient when searching for DTs at high depths and requires the definition of a maximum depth hyperparameter. Best-First-Search was also employed by other methods to circumvent these issues. The downside of this strategy is its higher memory consumption, as such, it has to be designed in a fully efficient manner that takes full advantage of the problem's structure. We formulate the problem within an AND/OR graph search framework and we solve it with a novel AO*-type algorithm called Branches. We prove both optimality and complexity guarantees for Branches and we show that it is more efficient than the state of the art theoretically and on a variety of experiments. Furthermore, Branches supports non-binary features unlike the other methods, we show that this property can further induce larger gains in computational efficiency.
