Interval Regression: A Comparative Study with Proposed Models
Tung L Nguyen, Toby Dylan Hocking
TL;DR
Interval-valued targets complicate regression tasks; this paper provides a comprehensive review of existing interval regression models and introduces three proposed approaches (KNN, MLP, MMIF). It systematically evaluates seven models across real-world and synthetic datasets using hinge-based loss, highlighting that MMIF generally achieves the best performance and consistency, with MMIT as a strong, lighter alternative. The AFT model in XGBoost exhibits limitations, particularly with left-censoring, and may require careful preprocessing. The study offers practical guidance for model selection in interval regression and contributes open-source code to support reproducible research.
Abstract
Regression models are essential for a wide range of real-world applications. However, in practice, target values are not always precisely known; instead, they may be represented as intervals of acceptable values. This challenge has led to the development of Interval Regression models. In this study, we provide a comprehensive review of existing Interval Regression models and introduce alternative models for comparative analysis. Experiments are conducted on both real-world and synthetic datasets to offer a broad perspective on model performance. The results demonstrate that no single model is universally optimal, highlighting the importance of selecting the most suitable model for each specific scenario.
