TIC-TAC: A Framework for Improved Covariance Estimation in Deep Heteroscedastic Regression
Megh Shukla, Mathieu Salzmann, Alexandre Alahi
TL;DR
This paper tackles sub-optimal covariance estimation in deep heteroscedastic regression by introducing Taylor Induced Covariance (TIC), a closed-form covariance approximation that ties the randomness of predictions to the local gradient and curvature via the Jacobian and Hessian of the mean predictor, plus an independent noise term. It also introduces Task Agnostic Correlations (TAC), a conditioning-based metric that directly assesses the quality of learned correlations in the covariance without ground-truth covariances. The authors show that TIC–learned covariances improve convergence of the negative log-likelihood and provide more faithful uncertainty representations across synthetic data, UCI regression, and 2D human pose estimation, with TAC offering a robust evaluation of covariance quality. The work includes practical guidance on limitations (e.g., Hessian computation) and demonstrates reproducibility via public code.
Abstract
Deep heteroscedastic regression involves jointly optimizing the mean and covariance of the predicted distribution using the negative log-likelihood. However, recent works show that this may result in sub-optimal convergence due to the challenges associated with covariance estimation. While the literature addresses this by proposing alternate formulations to mitigate the impact of the predicted covariance, we focus on improving the predicted covariance itself. We study two questions: (1) Does the predicted covariance truly capture the randomness of the predicted mean? (2) In the absence of supervision, how can we quantify the accuracy of covariance estimation? We address (1) with a Taylor Induced Covariance (TIC), which captures the randomness of the predicted mean by incorporating its gradient and curvature through the second order Taylor polynomial. Furthermore, we tackle (2) by introducing a Task Agnostic Correlations (TAC) metric, which combines the notion of correlations and absolute error to evaluate the covariance. We evaluate TIC-TAC across multiple experiments spanning synthetic and real-world datasets. Our results show that not only does TIC accurately learn the covariance, it additionally facilitates an improved convergence of the negative log-likelihood. Our code is available at https://github.com/vita-epfl/TIC-TAC
