Quantifying Local Model Validity using Active Learning
Sven Lämmle, Can Bogoclu, Robert Voßhall, Anselm Haselhoff, Dirk Roos
TL;DR
This paper tackles the challenge of validating ML predictions in a local, input-specific manner rather than relying on global accuracy. It formulates model validity as a two-boundary limit-state problem and uses a transformed Gaussian process to learn the boundary near the tolerance $\xi$, guided by a novel acquisition function synthetic to misclassification probability (MC-Prob). The authors provide frequentist error bounds, demonstrate data-efficient learning on analytical and real-model benchmarks, and compare against conformal prediction methods, showing substantial reductions in validation data needs while preserving safety-critical guarantees. The work offers practical impact for deploying reliable ML in regulated or safety-critical domains by enabling targeted, efficient validation of local performance.
Abstract
Real-world applications of machine learning models are often subject to legal or policy-based regulations. Some of these regulations require ensuring the validity of the model, i.e., the approximation error being smaller than a threshold. A global metric is generally too insensitive to determine the validity of a specific prediction, whereas evaluating local validity is costly since it requires gathering additional data.We propose learning the model error to acquire a local validity estimate while reducing the amount of required data through active learning. Using model validation benchmarks, we provide empirical evidence that the proposed method can lead to an error model with sufficient discriminative properties using a relatively small amount of data. Furthermore, an increased sensitivity to local changes of the validity bounds compared to alternative approaches is demonstrated.
