Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

Salar Abbaspourazad; Anshuman Mishra; Joseph Futoma; Andrew C. Miller; Ian Shapiro

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

Salar Abbaspourazad, Anshuman Mishra, Joseph Futoma, Andrew C. Miller, Ian Shapiro

TL;DR

This work introduces accelerometry foundation models for health by distilling representations from a high-fidelity PPG teacher to a low-fidelity accelerometer encoder using a fully unsupervised, two-stage framework trained on the Apple Heart and Movement Study data. The PPG teacher is pre-trained with masked autoencoding (and occasionally contrastive learning), and its embeddings are transferred to accelerometry via cross-modal contrastive learning on paired signals, with augmentations proving crucial. The distilled accelerometry encoders exhibit strong cross-modal alignment (approximately $99.2\%$ top-1 retrieval) and deliver superior performance across heart rate, heart rate variability, demographics, and 46 health targets, while also enabling model compression to smaller architectures. This generalist foundation-model behavior suggests accelerometry-based digital biomarkers can be broadly deployed across wearables, expanding accessible health monitoring while highlighting considerations for privacy, equity, and interpretability.

Abstract

Modern wearable devices can conveniently record various biosignals in the many different environments of daily living, enabling a rich view of individual health. However, not all biosignals are the same: high-fidelity biosignals, such as photoplethysmogram (PPG), contain more physiological information, but require optical sensors with a high power footprint. Alternatively, a lower-fidelity biosignal such as accelerometry has a significantly smaller power footprint and is available in almost any wearable device. While accelerometry is widely used for activity recognition and fitness, it is less explored for health biomarkers and diagnosis. Here, we show that an accelerometry foundation model can predict a wide variety of health targets. To achieve improved performance, we distill representational knowledge from PPG encoders to accelerometery encoders using 20 million minutes of unlabeled data, collected from ~172K participants in the Apple Heart and Movement Study under informed consent. We observe strong cross-modal alignment on unseen data, e.g., 99.2% top-1 accuracy for retrieving PPG embeddings from accelerometry embeddings. We show that distilled accelerometry encoders have significantly more informative representations compared to self-supervised or supervised encoders trained directly on accelerometry data, observed by at least 23%-49% improved performance for predicting heart rate and heart rate variability. We also show that distilled accelerometry encoders are readily predictive of a wide array of downstream health targets, i.e., they are generalist foundation models. We believe accelerometry foundation models for health may unlock new opportunities for developing digital biomarkers from any wearable device.

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

TL;DR

Abstract

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)