Apple Wearable Accelerometer Foundation Model

Wearable accelerometry foundation model distilled from a PPG encoder, predicting cardiovascular and health biomarkers from motion signals alone.

Released: December 2024

Wrist-worn accelerometers are ubiquitous in consumer wearables, yet they are typically used for coarse tasks like step counting and activity recognition. The Apple Wearable Accelerometer Foundation Model, introduced by Abbaspourazad and colleagues at Apple in December 2024, asks a more ambitious question: can a single generalist model learn rich representations from raw acceleration signals that transfer to a wide range of downstream health prediction tasks? The answer the paper demonstrates is yes — and crucially, that these representations can be imbued with cardiovascular information that is not directly observable in motion data alone.

The central innovation is cross-modal knowledge distillation. Photoplethysmography (PPG), the optical blood-volume sensor found in many smartwatches, carries direct signal about heart rate and heart rate variability, but it is power-hungry and not present on every device. Accelerometry, by contrast, is cheap, low-power, and nearly universal. By training an accelerometry encoder to match the embeddings of a pretrained PPG encoder, the model transfers physiological knowledge from the richer modality into the more accessible one, so that downstream biomarkers can be inferred from accelerometry alone.

The work fits into the broader movement toward foundation models for wearable and biosignal data, alongside Apple's earlier PPG and ECG foundation models. It is pretrained on data from the Apple Heart and Movement Study, one of the largest longitudinal wearable cohorts assembled, positioning it as a generalist backbone for population-scale digital health.

Key Features

Cross-modal distillation from PPG: A pretrained PPG encoder serves as the teacher, transferring cardiovascular structure into an accelerometry student encoder so motion data can predict signals it does not directly measure.
Generalist health representations: A single frozen backbone supports diverse downstream biomarker prediction tasks, including heart rate and heart rate variability, without per-task retraining.
Device-agnostic inference: Because predictions are driven by accelerometry, the model can in principle run on any motion-sensing wearable, not only devices equipped with optical heart-rate sensors.
Strong zero-shot retrieval: The aligned embedding space enables retrieval of a participant's PPG embedding from their accelerometry embedding with 99.2% top-1 accuracy, evidencing tight cross-modal alignment.
Population-scale pretraining: Learned from 20 million minutes of unlabeled data spanning roughly 172,000 participants in the Apple Heart and Movement Study.

Technical Details

The model is an accelerometry encoder trained via knowledge distillation against a PPG teacher encoder, using approximately 20 million minutes of unlabeled wearable recordings collected from about 172,000 participants. Pretraining is self-supervised in the sense that no human health labels are required; the supervisory signal comes entirely from the teacher's embeddings. After pretraining, the frozen accelerometry representations are evaluated on downstream tasks via lightweight probes. Relative to self-supervised and supervised accelerometry baselines, the distilled model delivers at least 23%–49% improved performance on predicting heart rate and heart rate variability, and achieves 99.2% top-1 accuracy on the cross-modal retrieval task of matching accelerometry embeddings to their corresponding PPG embeddings. The paper reports these results across multiple downstream health biomarkers, characterizing the embeddings as a generalist substrate rather than a single-task predictor.

Applications

The model targets continuous, passive health monitoring on consumer wearables. Because it extracts cardiovascular-informative features from accelerometry alone, it can extend heart-rate and HRV-style estimates to contexts where optical sensing is unavailable, unreliable, or too power-intensive — for example on simpler fitness trackers or during periods when the PPG sensor is gated to save battery. Researchers running large digital-health studies benefit from a reusable backbone that produces informative embeddings for many endpoints, reducing the labeled data needed for each new biomarker. More broadly, it illustrates a template for transferring knowledge from expensive, information-rich sensors to cheap, ubiquitous ones.

Impact

The work advances the case that wearable accelerometry, often treated as a low-value modality, can carry far more physiological information than its direct measurements suggest when paired with cross-modal distillation. It contributes to a growing family of Apple biosignal foundation models and to the wider effort to build generalist encoders for digital health. A key limitation for the research community is openness: as of publication, neither the model weights nor the training code were released, and the underlying Apple Heart and Movement Study data is not publicly available, so the results cannot be independently reproduced or directly built upon outside Apple. The contribution is therefore best read as a methodological demonstration of cross-modal distillation at population scale rather than a deployable open artifact.

Citation

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

Preprint

Abbaspourazad, S., et al. (2024) Wearable Accelerometer Foundation Models for Health via Knowledge Distillation. arXiv.org.

DOI: 10.48550/arXiv.2412.11276

Recent citations

Papers that recently cited this model.

A robust PPG foundation model using multimodal physiological supervision
E. Geenjaar, V. Calhoun, Scott Daly, et al.
Jun 2026
0
BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing
M. R. Kjaer, Haejun Han, Ashish Neupane, et al.
Jun 2026
0
WavesFM: Hierarchical Representation Learning for Longitudinal Wearable Sensor Waveforms
Peng Cao, Zhijian Yang, Tennison Liu, et al.
May 2026
0

Top citations

The most-cited papers that cite this model.

Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions
Eray Erturk, Fahad Kamran, Salar Abbaspourazad, et al.
International Conference on Machine Learning · Jun 2025
25
Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models
Megha Thukral, Cyrus Tanade, Simon A. Lee, et al.
arXiv.org · Jan 2026
5
Discriminative Representation Learning for Clinical Prediction
Yang Zhang, Lianyi Fan, Sam Lawrence, et al.
Mar 2026
2
Learning Clinical Representations Under Systematic Distribution Shift
Yuanyu Zhang, Shi Li
Mar 2026
2
Continuous Well-Being assessment and actionable feedback using explainable regression for Edge-Enabled wearable devices
Sheikh Muhammad Saqib, Tehseen Mazhar, Amal Al-Rasheed, et al.
Egyptian Informatics Journal · Dec 2025
2

Citations

Total Citations16

Influential0

References65

Fields of citing research

Computer Science100%
Medicine81%
Engineering56%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

5Closed

Usability — can I run it?7

Reproducibility — can I retrain it?0

not reproducible

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

Cross-modal distillation from PPG: A pretrained PPG encoder serves as the teacher, transferring cardiovascular structure into an accelerometry student encoder so motion data can predict signals it does not directly measure.

Generalist health representations: A single frozen backbone supports diverse downstream biomarker prediction tasks, including heart rate and heart rate variability, without per-task retraining.

Device-agnostic inference: Because predictions are driven by accelerometry, the model can in principle run on any motion-sensing wearable, not only devices equipped with optical heart-rate sensors.

Strong zero-shot retrieval: The aligned embedding space enables retrieval of a participant's PPG embedding from their accelerometry embedding with 99.2% top-1 accuracy, evidencing tight cross-modal alignment.

Population-scale pretraining: Learned from 20 million minutes of unlabeled data spanning roughly 172,000 participants in the Apple Heart and Movement Study.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

A robust PPG foundation model using multimodal physiological supervision

E. Geenjaar, V. Calhoun, Scott Daly, et al.

Jun 2026

BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

M. R. Kjaer, Haejun Han, Ashish Neupane, et al.

Jun 2026

WavesFM: Hierarchical Representation Learning for Longitudinal Wearable Sensor Waveforms

Peng Cao, Zhijian Yang, Tennison Liu, et al.

May 2026

Top citations

The most-cited papers that cite this model.

Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions

Eray Erturk, Fahad Kamran, Salar Abbaspourazad, et al.

International Conference on Machine Learning · Jun 2025

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

Megha Thukral, Cyrus Tanade, Simon A. Lee, et al.

arXiv.org · Jan 2026

Discriminative Representation Learning for Clinical Prediction

Yang Zhang, Lianyi Fan, Sam Lawrence, et al.

Mar 2026

Learning Clinical Representations Under Systematic Distribution Shift

Yuanyu Zhang, Shi Li

Mar 2026

Continuous Well-Being assessment and actionable feedback using explainable regression for Edge-Enabled wearable devices

Sheikh Muhammad Saqib, Tehseen Mazhar, Amal Al-Rasheed, et al.

Egyptian Informatics Journal · Dec 2025

Apple Wearable Accelerometer Foundation Model

#Key Features

#Technical Details

#Applications

#Impact

Citation

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

Recent citations

A robust PPG foundation model using multimodal physiological supervision

BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

WavesFM: Hierarchical Representation Learning for Longitudinal Wearable Sensor Waveforms

Top citations

Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

Discriminative Representation Learning for Clinical Prediction

Learning Clinical Representations Under Systematic Distribution Shift

Continuous Well-Being assessment and actionable feedback using explainable regression for Edge-Enabled wearable devices

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Apple Wearable Accelerometer Foundation Model

#Key Features

#Technical Details

#Applications

#Impact

Citation

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

Recent citations

A robust PPG foundation model using multimodal physiological supervision

BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

WavesFM: Hierarchical Representation Learning for Longitudinal Wearable Sensor Waveforms

Top citations

Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

Discriminative Representation Learning for Clinical Prediction

Learning Clinical Representations Under Systematic Distribution Shift

Continuous Well-Being assessment and actionable feedback using explainable regression for Edge-Enabled wearable devices

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact