Tempered Calculus for ML: Application to Hyperbolic Model Embedding
Richard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen, Manfred K. Warmuth
TL;DR
Tempered Calculus for ML introduces a generalization of integration through $t$-additivity and its calculus, unifying classical and nonextensive perspectives to shape ML distortion measures. It defines a $t$-Riemann integral and a generalized derivative, linked by $ extfrak{g}_t(z)= ext{log}_t( ext{exp}(z))$, enabling tunable distortion properties such as hyperbolicity and metricity. The framework is applied to hyperbolic embeddings, notably for embedding boosted decision trees in the Poincaré disk, supported by Monotonic Decision Trees (MDT) to maintain interpretable, monotone confidence paths and a boosting scheme (logisticBoost) that preserves hyperbolic structure. The work provides both theoretical tools and practical embedding methods, including Lorentz and Poincaré models under tempering, with empirical demonstrations on DT/MDT embeddings and their interpretability in hyperbolic space. Together, these contributions offer a principled way to design and analyze ML distortions with controlled geometric properties for improved encoding, hierarchy capture, and visualization in hyperbolic representations.
Abstract
Most mathematical distortions used in ML are fundamentally integral in nature: $f$-divergences, Bregman divergences, (regularized) optimal transport distances, integral probability metrics, geodesic distances, etc. In this paper, we unveil a grounded theory and tools which can help improve these distortions to better cope with ML requirements. We start with a generalization of Riemann integration that also encapsulates functions that are not strictly additive but are, more generally, $t$-additive, as in nonextensive statistical mechanics. Notably, this recovers Volterra's product integral as a special case. We then generalize the Fundamental Theorem of calculus using an extension of the (Euclidean) derivative. This, along with a series of more specific Theorems, serves as a basis for results showing how one can specifically design, alter, or change fundamental properties of distortion measures in a simple way, with a special emphasis on geometric- and ML-related properties that are the metricity, hyperbolicity, and encoding. We show how to apply it to a problem that has recently gained traction in ML: hyperbolic embeddings with a "cheap" and accurate encoding along the hyperbolic vs Euclidean scale. We unveil a new application for which the Poincaré disk model has very appealing features, and our theory comes in handy: \textit{model} embeddings for boosted combinations of decision trees, trained using the log-loss (trees) and logistic loss (combinations).
