NormWear

Multimodal foundation model for wearable physiological sensing across PPG, ECG, EEG, GSR, and IMU signals, using channel-aware attention.

Released: December 2024

Parameters: 200 Million

NormWear is a multimodal foundation model for wearable physiological sensing, designed to extract generalizable representations from the heterogeneous time-series signals produced by consumer and clinical wearables. Wearable devices capture diverse modalities — photoplethysmography (PPG), electrocardiography (ECG), electroencephalography (EEG), galvanic skin response (GSR), and inertial measurement (IMU) — but real-world data is plagued by variability in sensor placement, sampling rate, channel availability, and device configuration. Most prior models were trained narrowly for a single signal type or task, limiting transfer. NormWear targets this fragmentation by learning a shared representation space that is compatible with arbitrary combinations of sensors and channels.

Developed by Yunfei Luo, Yuliang Chen, Asif Salekin, and Tauhidur Rahman at the MOSAIC mobile-sensing lab at the University of California, San Diego, NormWear was first released as a preprint in December 2024 and has since been accepted (in press, 2026) at ACM Transactions on Computing for Healthcare. It is presented as the first general-purpose foundation model spanning this breadth of wearable modalities, positioning it alongside emerging biosignal foundation models while emphasizing cross-sensor generality rather than single-modality specialization.

The central technical contribution is a channel-aware attention mechanism paired with a shared liaison [CLS] token, which lets the model reason both within individual sensor channels and across multiple sensors simultaneously, regardless of how many channels a given downstream dataset provides.

Key Features

Cross-modal generality: A single pretrained encoder handles PPG, ECG, EEG, GSR, and IMU signals, accommodating variable channel counts and sensor configurations without per-modality retraining.
Channel-aware attention: A shared liaison [CLS] token mediates attention within each sensor channel and across channels, capturing both intra-sensor temporal structure and inter-sensor relationships.
Self-supervised pretraining: The model is pretrained on large-scale public physiological datasets using a masked-reconstruction objective with configurable time/frequency masking schemes, requiring no task labels.
Time-frequency representation: Continuous Wavelet Transform (CWT) scalograms feed the encoder, exposing multi-scale frequency structure that is informative for periodic biosignals like PPG and ECG.
Flexible evaluation regimes: NormWear supports zero-shot inference via signal-text alignment, plus partial-shot and full-shot fine-tuning for both classification and regression tasks.

Technical Details

NormWear is a transformer-based encoder (with a decoder used during masked pretraining) of roughly 0.2B parameters. Inputs are converted to CWT scalograms and tokenized into patches, producing 768-dimensional patch embeddings with on the order of 365 patches per channel. Pretraining uses masked signal reconstruction as the self-supervised objective, with random or structured masking applied across time and frequency dimensions. The channel-aware attention layers operate over the per-channel patch sequences while the shared [CLS] token aggregates information across channels, enabling the architecture to scale to whatever sensor set a downstream dataset exposes. The model was benchmarked across 11 public wearable sensing datasets spanning 18 downstream applications in four domains — mental health, body-state inference, vital-sign estimation, and disease-risk evaluation — under zero-shot, partial-shot, and full-shot protocols, where it reported consistent improvements over state-of-the-art baselines.

Applications

NormWear is intended for researchers and developers building health-inference pipelines on wearable data. Its representations support mental-health assessment, body-state inference (such as activity or stress states), vital-sign estimation (for example heart-rate or respiration-related targets), and disease-risk evaluation. Because the model accepts heterogeneous sensor inputs and supports zero-shot text-aligned classification, it lowers the barrier for teams with small or label-scarce datasets to bootstrap a task without training from scratch, and it provides a common backbone for benchmarking new wearable health applications.

Impact

NormWear advances the case for general-purpose foundation models in the biosignal domain, where most prior work was siloed by signal type. By demonstrating that one channel-aware encoder can transfer across five modalities and 18 tasks, it offers a reusable starting point for the mobile-health community. The code is released under Apache-2.0 with pretrained weights distributed via Hugging Face and a GitHub release, supporting reproduction and extension. As a recent release its long-term adoption is still emerging, and reported gains depend on the specific dataset and evaluation regime; broader independent validation across devices and populations will determine how well its generality holds in deployment.

Citations

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Preprint

Luo, Y., et al. (2024) Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals. ACM Transactions on Computing for Healthcare.

DOI: 10.48550/arXiv.2412.09758

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Luo, Y., et al. (2024) Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals. ACM Transactions on Computing for Healthcare.

DOI: 10.1145/3803808

Recent citations

Papers that recently cited this model.

SpO$_2$ Predictor-Guided Stage-Wise Time-Frequency Reconstruction of Low-Quality Dual-Wavelength PPG for Oxygen Saturation Estimation
Zequan Liang, Elahe Hosseini, Ning Miao, et al.
Jul 2026
0
Physical activities enable scalable foundation modelling for broad-spectrum health prediction
Zhenghuang Wu, Yuyao Zhu, Songli Xu
Jul 2026
0
Toward the Open World: Closed-Loop Psychophysiological Intervention Systems Driven by Biosignal Foundation Models
Jingyu Liu, Xuan Zhang, Yang Li, et al.
IEEE Transactions on Computational Social Systems · Jun 2026
0

Top citations

The most-cited papers that cite this model.

ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents
Zechen Li, Baiyu Chen, Hao Xue, et al.
Aug 2025
13
Cardiac health assessment across scenarios and devices using a multimodal foundation model pretrained on data from 1.7 million individuals
Xiao Gu, Wei Tang, Jinpei Han, et al.
Nature Machine Intelligence · Feb 2026
8
Position: Beyond Prediction: Toward Verifiable Physiological Waveform Reasoning with Foundation Models and Agentic LLMs
Xiaoda Wang, Ching Chang, Defu Cao, et al.
5
Foundation Models Defining A New Era In Sensor-based Human Activity Recognition: A Survey And Outlook
Sizhen Bian, Mengxi Liu, L. Ray, et al.
Apr 2026
2Influential
Wearable Foundation Models Should Go Beyond Static Encoders
Y. Wu, Yuwei Zhang, Hyungjun Yoon, et al.
Mar 2026
2Influential

Citations

Total Citations31

Influential4

References88

GitHub

Stars59

Forks14

Open Issues9

Contributors2

Last Push2mo ago

LanguagePython

LicenseApache-2.0

HuggingFace

Downloads116

Likes2

Last Modified2mo ago

Fields of citing research

Computer Science90%
Engineering63%
Medicine63%
Environmental Science7%
Biology7%
Psychology3%
Education3%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

77Open

Usability — can I run it?100

Reproducibility — can I retrain it?63

Model Openness Framework

Class II

Open Tooling

Resources

GitHub Repository Research Paper HuggingFace Model

Key Features

Cross-modal generality: A single pretrained encoder handles PPG, ECG, EEG, GSR, and IMU signals, accommodating variable channel counts and sensor configurations without per-modality retraining.

Channel-aware attention: A shared liaison [CLS] token mediates attention within each sensor channel and across channels, capturing both intra-sensor temporal structure and inter-sensor relationships.

Self-supervised pretraining: The model is pretrained on large-scale public physiological datasets using a masked-reconstruction objective with configurable time/frequency masking schemes, requiring no task labels.

Time-frequency representation: Continuous Wavelet Transform (CWT) scalograms feed the encoder, exposing multi-scale frequency structure that is informative for periodic biosignals like PPG and ECG.

Flexible evaluation regimes: NormWear supports zero-shot inference via signal-text alignment, plus partial-shot and full-shot fine-tuning for both classification and regression tasks.

Technical Details

Applications

Impact

Citations

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Preprint

Luo, Y., et al. (2024) Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals. ACM Transactions on Computing for Healthcare.

DOI: 10.48550/arXiv.2412.09758

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Luo, Y., et al. (2024) Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals. ACM Transactions on Computing for Healthcare.

DOI: 10.1145/3803808

Recent citations

Papers that recently cited this model.

SpO$_2$ Predictor-Guided Stage-Wise Time-Frequency Reconstruction of Low-Quality Dual-Wavelength PPG for Oxygen Saturation Estimation

Zequan Liang, Elahe Hosseini, Ning Miao, et al.

Jul 2026

Physical activities enable scalable foundation modelling for broad-spectrum health prediction

Zhenghuang Wu, Yuyao Zhu, Songli Xu

Jul 2026

Toward the Open World: Closed-Loop Psychophysiological Intervention Systems Driven by Biosignal Foundation Models

Jingyu Liu, Xuan Zhang, Yang Li, et al.

IEEE Transactions on Computational Social Systems · Jun 2026

Top citations

The most-cited papers that cite this model.

ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents

Zechen Li, Baiyu Chen, Hao Xue, et al.

Aug 2025

Cardiac health assessment across scenarios and devices using a multimodal foundation model pretrained on data from 1.7 million individuals

Xiao Gu, Wei Tang, Jinpei Han, et al.

Nature Machine Intelligence · Feb 2026

Position: Beyond Prediction: Toward Verifiable Physiological Waveform Reasoning with Foundation Models and Agentic LLMs

Xiaoda Wang, Ching Chang, Defu Cao, et al.

Foundation Models Defining A New Era In Sensor-based Human Activity Recognition: A Survey And Outlook

Sizhen Bian, Mengxi Liu, L. Ray, et al.

Apr 2026

2Influential

Wearable Foundation Models Should Go Beyond Static Encoders

Y. Wu, Yuwei Zhang, Hyungjun Yoon, et al.

Mar 2026

2Influential

NormWear

#Key Features

#Technical Details

#Applications

#Impact

Citations

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Recent citations

SpO$_2$ Predictor-Guided Stage-Wise Time-Frequency Reconstruction of Low-Quality Dual-Wavelength PPG for Oxygen Saturation Estimation

Physical activities enable scalable foundation modelling for broad-spectrum health prediction

Toward the Open World: Closed-Loop Psychophysiological Intervention Systems Driven by Biosignal Foundation Models

Top citations

ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents

Cardiac health assessment across scenarios and devices using a multimodal foundation model pretrained on data from 1.7 million individuals

Position: Beyond Prediction: Toward Verifiable Physiological Waveform Reasoning with Foundation Models and Agentic LLMs

Foundation Models Defining A New Era In Sensor-based Human Activity Recognition: A Survey And Outlook

Wearable Foundation Models Should Go Beyond Static Encoders

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

NormWear

#Key Features

#Technical Details

#Applications

#Impact

Citations

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Recent citations

SpO$_2$ Predictor-Guided Stage-Wise Time-Frequency Reconstruction of Low-Quality Dual-Wavelength PPG for Oxygen Saturation Estimation

Physical activities enable scalable foundation modelling for broad-spectrum health prediction

Toward the Open World: Closed-Loop Psychophysiological Intervention Systems Driven by Biosignal Foundation Models

Top citations

ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents

Cardiac health assessment across scenarios and devices using a multimodal foundation model pretrained on data from 1.7 million individuals

Position: Beyond Prediction: Toward Verifiable Physiological Waveform Reasoning with Foundation Models and Agentic LLMs

Foundation Models Defining A New Era In Sensor-based Human Activity Recognition: A Survey And Outlook

Wearable Foundation Models Should Go Beyond Static Encoders

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact