Output-Constrained Decision Trees
Hüseyin Tunç, Doğanay Özese, Ş. İlker Birbil, Donato Maragno, Marco Caserta, Mustafa Baydoğan
TL;DR
This work tackles the challenge of producing feasible multi-target predictions by embedding output constraints directly into decision-tree training. It introduces three OCRT approaches—M-OCRT (MIP-based), E-OCRT (enumerative split search with constrained prediction), and EP-OCRT (post-hoc constrained correction)—and extends them to ensemble methods under convex feasible sets. Through synthetic and hierarchical time series experiments, the authors show that enforcing feasibility improves decision-making in downstream optimization tasks, with E-OCRT generally offering the best accuracy and M-OCRT providing flexibility at higher computational cost. The study also discusses generalized losses and optimization-with-constraint-learning contexts, highlighting the practical impact for HTS forecasting, inventory management, and resource allocation, while noting scalability and convexity assumptions as key considerations.
Abstract
Incorporating domain-specific constraints into machine learning models is essential for generating predictions that are both accurate and feasible in real-world applications. This paper introduces new methods for training Output-Constrained Regression Trees (OCRT), addressing the limitations of traditional decision trees in constrained multi-target regression tasks. We propose three approaches: M-OCRT, which uses split-based mixed integer programming to enforce constraints; E-OCRT, which employs an exhaustive search for optimal splits and solves constrained prediction problems at each decision node; and EP-OCRT, which applies post-hoc constrained optimization to tree predictions. To illustrate their potential uses in ensemble learning, we also introduce a random forest framework working under convex feasible sets. We validate the proposed methods through a computational study both on synthetic and industry-driven hierarchical time series datasets. Our results demonstrate that imposing constraints on decision tree training results in accurate and feasible predictions.
