On the Correctness of the Generalized Isotonic Recursive Partitioning Algorithm
Joong-Ho Won, Jihan Jung
TL;DR
The paper addresses isotonic regression under separable convex losses and shows that, although nonuniqueness can arise and naive GIRP may fail to preserve isotonicity, there exists a correct solution reachable by a minimal modification of GIRP that uses binary partitions and carefully chosen intermediate fits. It introduces Recursive Partitioning (RP) that fixes a reference $b$ and, at each node, selects $b_L$ and $b_U$ closest to $b$ for the lower and upper parts, ensuring the piecewise fits minimize the loss on each subdomain and yield a global isotonic minimizer. A key theoretical contribution is extending the projection-pair framework via maximal upper (or minimal lower) sets and the set-valued derivative $\Sigma_b$ to both differentiable and nondifferentiable losses, together with practical LP/ max-flow formulations and a terminating criterion. The results demonstrate that (i) a globally optimal isotonic solution exists and (ii) RP finds one without backtracking, with the original GIRP recovered when the constant-fit sets are singletons. This yields a robust, general approach for isotonic regression applicable to huber, hinge, and logistic-like losses with guaranteed isotonic intermediate results and correctness.
Abstract
This paper presents an in-depth analysis of the generalized isotonic recursive partitioning (GIRP) algorithm for fitting isotonic models under separable convex losses, proposed by Luss and Rosset [J. Comput. Graph. Statist., 23 (2014), pp. 192--201] for differentiable losses and extended by Painsky and Rosset [IEEE Trans. Pattern Anal. Mach. Intell., 38 (2016), pp. 308-321] for nondifferentiable losses. The GIRP algorithm poseses an attractive feature that in each step of the algorithm, the intermediate solution satisfies the isotonicity constraint. The paper begins with an example showing that the GIRP algorithm as described in the literature may fail to produce an isotonic model, suggesting that the existence and uniqueness of the solution to the isotonic regression problem must be carefully addressed. It proceeds with showing that, among possibly many solutions, there indeed exists a solution that can be found by recursive binary partitioning of the set of observed data. A small modification of the GIRP algorithm suffices to obtain a correct solution and preserve the desired property that all the intermediate solutions are isotonic. This proposed modification includes a proper choice of intermediate solutions and a simplification of the partitioning step from ternary to binary.
