Recovering Pulse Waves from Video Using Deep Unrolling and Deep Equilibrium Models

Vineet R Shenoy; Suhas Lohit; Hassan Mansour; Rama Chellappa; Tim K. Marks

Recovering Pulse Waves from Video Using Deep Unrolling and Deep Equilibrium Models

Vineet R Shenoy, Suhas Lohit, Hassan Mansour, Rama Chellappa, Tim K. Marks

TL;DR

The paper tackles non-contact heart-rate estimation from facial video (iPPG) by formulating pulse waveform recovery as an inverse problem with learned priors. It introduces three approaches—Unrolled iPPG, DE-Prox-iPPG, and UDEQ-iPPG—that couple gradient-descent data fidelity with neural denoisers, including fixed-point DEQ components, to recover the pulsatile signal. Across MMSE-HR, PURE, and UBFC-rPPG, the methods achieve state-of-the-art HR estimates while using a fraction of the parameters of competing models, with UDEQ-iPPG delivering the best overall performance and generalization. This framework provides a principled, interpretable path to robust pulse waveform recovery from video, enabling accurate HR monitoring in challenging real-world scenarios with lower model complexity.

Abstract

Camera-based monitoring of vital signs, also known as imaging photoplethysmography (iPPG), has seen applications in driver-monitoring, perfusion assessment in surgical settings, affective computing, and more. iPPG involves sensing the underlying cardiac pulse from video of the skin and estimating vital signs such as the heart rate or a full pulse waveform. Some previous iPPG methods impose model-based sparse priors on the pulse signals and use iterative optimization for pulse wave recovery, while others use end-to-end black-box deep learning methods. In contrast, we introduce methods that combine signal processing and deep learning methods in an inverse problem framework. Our methods estimate the underlying pulse signal and heart rate from facial video by learning deep-network-based denoising operators that leverage deep algorithm unfolding and deep equilibrium models. Experiments show that our methods can denoise an acquired signal from the face and infer the correct underlying pulse rate, achieving state-of-the-art heart rate estimation performance on well-known benchmarks, all with less than one-fifth the number of learnable parameters as the closest competing method.

Recovering Pulse Waves from Video Using Deep Unrolling and Deep Equilibrium Models

TL;DR

Abstract

Recovering Pulse Waves from Video Using Deep Unrolling and Deep Equilibrium Models

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (1)