PhysioWave

ETH Zurich / University of Bologna / University of Modena and Reggio Emilia

Physiological signal foundation model for ECG, EMG, and EEG pairing learnable multi-scale wavelet decomposition with masked transformer pretraining.

Released: June 2025

Parameters: 37 Million

PhysioWave is a foundation model family for physiological time-series signals that replaces the fixed patch-embedding front end of standard signal transformers with a learnable, multi-scale wavelet decomposition. Physiological recordings—electrocardiography (ECG), electromyography (EMG), and electroencephalography (EEG)—are frequently corrupted by motion artifacts, baseline drift, and other low signal-to-noise disturbances, and they carry diagnostically important structure at very different time scales. Wavelet analysis is a classical tool for this kind of non-stationary, multi-resolution signal, and PhysioWave makes that decomposition trainable and couples it to a transformer encoder pretrained with masked reconstruction.

Developed by researchers at ETH Zurich, the University of Bologna, and the University of Modena and Reggio Emilia, the work was released as a preprint in June 2025 and accepted at NeurIPS 2025. The authors pretrain separate modality-specific backbones for ECG and EMG on large unlabeled corpora, then show that a single architecture transfers across a wide range of downstream biosignal tasks. They additionally demonstrate a multi-modal extension that fuses ECG, EMG, and EEG branches through a learnable weighted combination.

PhysioWave sits within the emerging class of biosignal foundation models, applying the self-supervised pretrain-then-adapt recipe—well established for language and protein models—to wearable and clinical physiological sensing, where raw recordings are abundant but task labels are scarce.

Key Features

Learnable wavelet decomposition: An adaptive wavelet selector chooses among candidate basis functions and performs multi-resolution analysis across several decomposition levels, so the time-frequency front end is optimized jointly with the network rather than fixed in advance.
Frequency-guided masked pretraining: Masking is steered by FFT-based spectral energy to prioritize high-energy components, and patches are reconstructed with a Smooth-L1 loss, focusing self-supervision on the most informative parts of the signal.
Multi-scale transformer encoder: A Cross-Scale Channel-Aggregation Feed-Forward Network (CAFFN) and rotary positional embeddings (RoPE) let the attention layers integrate features across resolutions and channels.
Multi-modal fusion: Modality-specific ECG, EMG, and EEG branches are combined through a learnable weighted fusion, enabling joint inference when several signal types are recorded together.
Open code and three model sizes: The implementation is released under the MIT license, with Small (5M), Base (15M), and Large (37M) parameter configurations per modality.

Technical Details

The ECG backbone is pretrained on roughly 182 GB of data drawn from MIMIC-IV-ECG, MedalCare-XL, CODE-15%, the Norwegian Athlete ECG Database, and the Georgia cohort; the EMG backbone is pretrained on roughly 823 GB spanning NinaPro DB6/DB7/DB8, EMG2Pose, and EMG2Qwerty. Each modality is offered in Small (5M), Base (15M), and Large (37M) parameter variants. On downstream evaluation, the Large ECG model reaches 66.7% F1 and 94.6% AUROC on PTB-XL; EMG models reach 94.50% accuracy on EPN-612, 87.53% on NinaPro DB5, and 93.19% on UCI EMG. The multi-modal variant improves emotion recognition on DEAP to 85.2% valence (+6.1%) and 88.6% arousal (+7.3%) accuracy, and reaches 74.9% (+4.5%) on MPDB, indicating that the wavelet front end and fusion add value over prior baselines.

Applications

PhysioWave targets researchers and engineers building models for wearable and clinical biosignals. The pretrained ECG backbone supports cardiac tasks such as arrhythmia detection and multi-label diagnostic classification; the EMG backbone supports gesture and movement decoding for prosthetics, human-computer interaction, and rehabilitation; and the multi-modal configuration supports affective-computing tasks such as emotion recognition from combined physiological streams. Because the backbones are pretrained on large unlabeled corpora, teams with limited labeled data can fine-tune for a specific downstream task instead of training a bespoke model from scratch.

Impact

PhysioWave contributes a reusable, openly licensed architecture that brings learnable time-frequency analysis into the biosignal foundation-model toolkit, establishing new baselines across ECG, EMG, and multi-modal benchmarks and demonstrating that wavelet-based front ends can outperform fixed patch embeddings on non-stationary physiological data. Its acceptance at NeurIPS 2025 and release of training code with multiple model sizes lower the barrier for downstream adoption. Notable limitations include the absence of a published EEG-only pretrained backbone alongside the released ECG and EMG weights, distribution of weights via Google Drive rather than a versioned model hub, and the lack of formal model and data cards.

Citation

PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation

Preprint

Chen, Y., et al. (2025) PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation. arXiv.org.

DOI: 10.48550/arXiv.2506.10351

Recent citations

Papers that recently cited this model.

AI-enabled wireless wearable breathing sensor for breathing pattern recognition
Carter Comeau, Bhawya, P. Das, et al.
Scientific Reports · May 2026
0
Towards Real-Time ECG and EMG Modeling on $\mu$NPUs
Josh Millar, A. Thangarajan, Soumyajit Chatterjee, et al.
Apr 2026
0Influential
PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence
M. Zelic, Anna Tegon, Yawei Li, et al.
Apr 2026
0

Top citations

The most-cited papers that cite this model.

NeuroRVQ: Multi-Scale Biosignal Tokenization for Generative Foundation Models
Konstantinos Barmpas, Na Lee, Dimitrios Chalatsis, et al.
Oct 2025
6
Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models
Megha Thukral, Cyrus Tanade, Simon A. Lee, et al.
arXiv.org · Jan 2026
5Influential
TinyMyo: a Tiny Foundation Model for Flexible EMG Signal Processing at the Edge
Matteo Fasulo, Giusy Spacone, T. Ingolfsson, et al.
arXiv.org · Dec 2025
5Influential
Discriminative Representation Learning for Clinical Prediction
Yang Zhang, Lianyi Fan, Sam Lawrence, et al.
Mar 2026
2
Wearable Foundation Models Should Go Beyond Static Encoders
Y. Wu, Yuwei Zhang, Hyungjun Yoon, et al.
Mar 2026
2

Citations

Total Citations14

Influential2

References68

GitHub

Stars189

Forks25

Open Issues1

Contributors1

Last Push9mo ago

LanguagePython

LicenseMIT

Fields of citing research

Computer Science100%
Medicine67%
Engineering50%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

80Open

Usability — can I run it?95

Reproducibility — can I retrain it?66

Model Openness Framework

Unclassified

Missing required components

Resources

GitHub Repository Research Paper

Key Features

Learnable wavelet decomposition: An adaptive wavelet selector chooses among candidate basis functions and performs multi-resolution analysis across several decomposition levels, so the time-frequency front end is optimized jointly with the network rather than fixed in advance.

Frequency-guided masked pretraining: Masking is steered by FFT-based spectral energy to prioritize high-energy components, and patches are reconstructed with a Smooth-L1 loss, focusing self-supervision on the most informative parts of the signal.

Multi-scale transformer encoder: A Cross-Scale Channel-Aggregation Feed-Forward Network (CAFFN) and rotary positional embeddings (RoPE) let the attention layers integrate features across resolutions and channels.

Multi-modal fusion: Modality-specific ECG, EMG, and EEG branches are combined through a learnable weighted fusion, enabling joint inference when several signal types are recorded together.

Open code and three model sizes: The implementation is released under the MIT license, with Small (5M), Base (15M), and Large (37M) parameter configurations per modality.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

AI-enabled wireless wearable breathing sensor for breathing pattern recognition

Carter Comeau, Bhawya, P. Das, et al.

Scientific Reports · May 2026

Towards Real-Time ECG and EMG Modeling on $\mu$NPUs

Josh Millar, A. Thangarajan, Soumyajit Chatterjee, et al.

Apr 2026

0Influential

PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence

M. Zelic, Anna Tegon, Yawei Li, et al.

Apr 2026

Top citations

The most-cited papers that cite this model.

NeuroRVQ: Multi-Scale Biosignal Tokenization for Generative Foundation Models

Konstantinos Barmpas, Na Lee, Dimitrios Chalatsis, et al.

Oct 2025

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

Megha Thukral, Cyrus Tanade, Simon A. Lee, et al.

arXiv.org · Jan 2026

5Influential

TinyMyo: a Tiny Foundation Model for Flexible EMG Signal Processing at the Edge

Matteo Fasulo, Giusy Spacone, T. Ingolfsson, et al.

arXiv.org · Dec 2025

5Influential

Discriminative Representation Learning for Clinical Prediction

Yang Zhang, Lianyi Fan, Sam Lawrence, et al.

Mar 2026

Wearable Foundation Models Should Go Beyond Static Encoders

Y. Wu, Yuwei Zhang, Hyungjun Yoon, et al.

Mar 2026

PhysioWave

#Key Features

#Technical Details

#Applications

#Impact

Citation

PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation

Recent citations

AI-enabled wireless wearable breathing sensor for breathing pattern recognition

Towards Real-Time ECG and EMG Modeling on $\mu$NPUs

PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence

Top citations

NeuroRVQ: Multi-Scale Biosignal Tokenization for Generative Foundation Models

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

TinyMyo: a Tiny Foundation Model for Flexible EMG Signal Processing at the Edge

Discriminative Representation Learning for Clinical Prediction

Wearable Foundation Models Should Go Beyond Static Encoders

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

PhysioWave

#Key Features

#Technical Details

#Applications

#Impact

Citation

PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation

Recent citations

AI-enabled wireless wearable breathing sensor for breathing pattern recognition

Towards Real-Time ECG and EMG Modeling on $\mu$NPUs

PanLUNA: An Efficient and Robust Query-Unified Multimodal Model for Edge Biosignal Intelligence

Top citations

NeuroRVQ: Multi-Scale Biosignal Tokenization for Generative Foundation Models

Wavelet-Driven Masked Multiscale Reconstruction for PPG Foundation Models

TinyMyo: a Tiny Foundation Model for Flexible EMG Signal Processing at the Edge

Discriminative Representation Learning for Clinical Prediction

Wearable Foundation Models Should Go Beyond Static Encoders

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact