Zero-Shot Decision Tree Construction via Large Language Models
Lucas Carrasco, Felipe Urrutia, Andrés Abeliuk
TL;DR
The paper tackles learning from tabular data under data scarcity by constructing decision trees in a zero-shot fashion using large language models (LLMs). It frames tree construction as a sequence of LLM-driven steps: attribute splitting, probability estimation, and Gini impurity minimization, with harmonic-mean aggregation across branches and explicit prompts to avoid overconfidence. Experimental results show that zero-shot DTs built via LLMs can outperform a zero-shot TabLLM baseline and rival traditional trees in low-data settings, while maintaining interpretability. The work introduces a principled knowledge-driven baseline for zero-data tree construction, discusses biases and dependencies on pre-trained knowledge, and outlines practical avenues for refinement and fair, domain-specific deployment.
Abstract
This paper introduces a novel algorithm for constructing decision trees using large language models (LLMs) in a zero-shot manner based on Classification and Regression Trees (CART) principles. Traditional decision tree induction methods rely heavily on labeled data to recursively partition data using criteria such as information gain or the Gini index. In contrast, we propose a method that uses the pre-trained knowledge embedded in LLMs to build decision trees without requiring training data. Our approach leverages LLMs to perform operations essential for decision tree construction, including attribute discretization, probability calculation, and Gini index computation based on the probabilities. We show that these zero-shot decision trees can outperform baseline zero-shot methods and achieve competitive performance compared to supervised data-driven decision trees on tabular datasets. The decision trees constructed via this method provide transparent and interpretable models, addressing data scarcity while preserving interpretability. This work establishes a new baseline in low-data machine learning, offering a principled, knowledge-driven alternative to data-driven tree construction.
