Methods to integrate multinormals and compute classification measures
Abhranil Das, Wilson S Geisler
TL;DR
This work tackles the challenge of computing probabilities and classification performance for normal distributions in arbitrary dimensions and domains, where general analytical expressions are unavailable. It introduces two complementary approaches: a generalized chi-square method for quadratic domains and a ray-trace method for general, non-quadratic domains, together with open-source Matlab tools for integrating normals, obtaining distributions of functions of normals, and evaluating Bayes- and custom-classifier performance. The authors provide rigorous formulations for Bayes decision boundaries, the discriminability index $d'_b$, and dimension-wise contributions, and demonstrate the methods on vision-science problems such as occluding-target detection and camouflage, with performance benchmarks showing high accuracy and speed, including parallelizable ray-trace computations. The practical impact is a versatile, machine-precision toolbox for validating probabilistic decision models in uncertainty, enabling precise model comparison, dimension reduction, and diagnostic tests across complex, multi-cue visual tasks.
Abstract
Univariate and multivariate normal probability distributions are widely used when modeling decisions under uncertainty. Computing the performance of such models requires integrating these distributions over specific domains, which can vary widely across models. Besides some special cases, there exist no general analytical expressions, standard numerical methods or software for these integrals. Here we present mathematical results and open-source software that provide (i) the probability in any domain of a normal in any dimensions with any parameters, (ii) the probability density, cumulative distribution, and inverse cumulative distribution of any function of a normal vector, (iii) the classification errors among any number of normal distributions, the Bayes-optimal discriminability index and relation to the operating characteristic, (iv) ways to scale the discriminability of two distributions, (v) dimension reduction and visualizations for such problems, and (vi) tests for how reliably these methods may be used on given data. We demonstrate these tools with vision research applications of detecting occluding objects in natural scenes, and detecting camouflage.
