Uncertainty-Aware Remaining Lifespan Prediction from Images
Tristan Kenneweg, Philip Kenneweg, Barbara Hammer
TL;DR
Addresses remaining lifespan prediction from facial and full-body images and assesses the information content for person-centric health screening. Proposes an uncertainty-aware regression framework using pretrained vision transformers with a Gaussian mean–variance head trained by Gaussian negative log-likelihood loss $\mathcal{L} = \frac{1}{2N} \sum_i (\log(\sigma_i^2) + \frac{(y_i - \mu_i)^2}{\sigma_i^2})$, estimating both the mean remaining lifespan $\mu$ and its uncertainty $\sigma$. Demonstrates state-of-the-art MAEs of $4.91$ years on Faces, $4.99$ years on Whole Images, and $7.41$ years on the Legacy dataset, with a bucketed calibration error of $0.82$ years on the Faces data, and releases code and curated datasets to enable replication and further research. The work highlights the potential to extract medically relevant signals from images and emphasizes uncertainty calibration as a key component for responsible, scalable health screening research.
Abstract
Predicting mortality-related outcomes from images offers the prospect of accessible, noninvasive, and scalable health screening. We present a method that leverages pretrained vision transformer foundation models to estimate remaining lifespan from facial and whole-body images, alongside robust uncertainty quantification. We show that predictive uncertainty varies systematically with the true remaining lifespan, and that this uncertainty can be effectively modeled by learning a Gaussian distribution for each sample. Our approach achieves state-of-the-art mean absolute error (MAE) of 7.41 years on an established dataset, and further achieves 4.91 and 4.99 years MAE on two new, higher-quality datasets curated and published in this work. Importantly, our models provide calibrated uncertainty estimates, as demonstrated by a bucketed expected calibration error of 0.82 years on the Faces Dataset. While not intended for clinical deployment, these results highlight the potential of extracting medically relevant signals from images. We make all code and datasets available to facilitate further research.
