On the Stability of a non-hyperbolic nonlinear map with non-bounded set of non-isolated fixed points with applications to Machine Learning
Roberta Hansen, Matias Vera, Lautaro Estienne, Luciana Ferrer, Pablo Piantanida
TL;DR
The paper addresses the convergence of the SUCPA calibration map, a non-hyperbolic nonlinear map with a non-bounded set of non-isolated fixed points lying on a straight line ${\mathcal{S}}_b$ and slope 1. It develops a non-standard, geometry-based analysis for a two-class case, proving that every initial condition converges to a fixed point on ${\mathcal{S}}_b$ and characterizing the Jacobian with a single center mode alongside a stable direction. Central to the approach are auxiliary functions $\alpha_1(x)$ and $\alpha_2(x)$ and an intercept-update function $\phi(x)$, which together establish existence, uniqueness, and global attractivity of ${\mathcal{S}}_b$; numerical experiments on sentiment polarity, MNLI entailment, and dog-vs-cat image classification corroborate the theory. The results provide convergence guarantees for SUCPA under prior mismatch in the two-class setting and offer insight into the algorithm’s behavior in higher-class scenarios, with practical implications for calibration in NLP and vision tasks.
Abstract
This paper deals with the convergence analysis of the SUCPA (Semi Unsupervised Calibration through Prior Adaptation) algorithm, defined from a first-order non-linear difference equations, first developed to correct the scores output by a supervised machine learning classifier. The convergence analysis is addressed as a dynamical system problem, by studying the local and global stability of the nonlinear map derived from the algorithm. This map, which is defined by a composition of exponential and rational functions, turns out to be non-hyperbolic with a non-bounded set of non-isolated fixed points. Hence, a non-standard method for solving the convergence analysis is used consisting of an ad-hoc geometrical approach. For a binary classification problem (two-dimensional map), we rigorously prove that the map is globally asymptotically stable. Numerical experiments on real-world application are performed to support the theoretical results by means of two different classification problems: Sentiment Polarity performed with a Large Language Model and Cat-Dog Image classification. For a greater number of classes, the numerical evidence shows the same behavior of the algorithm, and this is illustrated with a Natural Language Inference example. The experiment codes are publicly accessible online at the following repository: https://github.com/LautaroEst/sucpa-convergence
