Eko Digital Stethoscope CVD Foundation Model

Masked-autoencoder foundation model pretrained on digital-stethoscope heart sounds and single-lead ECG for cardiovascular disease detection.

Released: October 2024

Parameters: 85.3 Million

Digital stethoscopes capture two synchronized biosignals at the point of care: the phonocardiogram (PCG), an acoustic recording of heart sounds, and a single-lead electrocardiogram (ECG). While these signals carry rich information about cardiovascular disease, building accurate detection algorithms has historically been limited by the scarcity of expertly annotated recordings. This work, published by Eko Health in npj Cardiovascular Health in October 2024, addresses that bottleneck by pretraining transformer-based foundation models on large volumes of unlabeled stethoscope data and then fine-tuning them for specific clinical tasks.

The authors adapt the masked autoencoder (MAE) self-supervised framework, originally developed for images, to single- and multi-signal stethoscope data. PCG recordings are converted to mel-spectrograms and split into patches, ECG signals are split into temporal segments, and the model learns to reconstruct masked portions of each. By learning general-purpose representations from recordings collected during routine clinical practice, the resulting encoders transfer effectively to downstream detection problems where labeled data is limited.

This is, to the authors' knowledge, the first foundation-model approach built specifically for synchronously captured PCG and ECG from digital stethoscopes, and it demonstrates strong performance across structural murmur, atrial fibrillation, and reduced ejection fraction detection.

Key Features

Self-supervised pretraining on routine recordings: Models are pretrained with a masked autoencoder objective on up to 1,890,304 unlabeled PCG recordings collected from Eko devices in everyday clinical use, removing the need for labels during the representation-learning phase.
Multi-modal stethoscope signals: Separate PCG and ECG encoders, plus a combined PCG-ECG variant, exploit the two synchronized signals a digital stethoscope captures at a single auscultation site.
Signal-specific masking: PCG mel-spectrograms use 16×16 patches with 70% masking, while ECG uses 1×625 temporal segments with 30% masking, tailoring the reconstruction task to each modality.
Transferable to multiple cardiac tasks: The pretrained encoder is fine-tuned for structural-heart-disease murmur detection, atrial fibrillation, and low ejection fraction, three clinically distinct screening problems.

Technical Details

The architecture is a "base"-scale vision-transformer-style MAE. The encoder comprises 12 transformer layers with a 768-dimensional embedding, 3072-dimensional feed-forward blocks, and 12 attention heads, totaling 85,254,144 trainable parameters; a lighter decoder (4 layers, 384-dimensional embedding, 6 heads, 7,492,864 parameters) is used only during pretraining, for a combined 92.7M parameters. Pretraining corpora span 1,890,304 PCG recordings, 241,664 ECG recordings, and 221,184 paired PCG-ECG recordings. After fine-tuning, the models reach an AUROC of 98.3% (99.0% on real-world evidence) for structural-heart-disease murmur detection, 98.0% (97.9% on a held-out test set) for atrial fibrillation, and 84.5% for low ejection fraction detection, with self-supervised pretraining consistently improving over training from scratch on the same labeled data.

Applications

The models target point-of-care cardiovascular screening using hardware already deployed in clinics. Fine-tuned detectors for structural murmurs, atrial fibrillation, and reduced ejection fraction could help primary-care clinicians flag patients for echocardiography or specialist referral during routine exams, including in resource-limited settings where access to cardiology is scarce. More broadly, the pretrained encoders provide a reusable starting point for developing additional stethoscope-based biosignal classifiers without assembling large labeled datasets for each new condition.

Impact

The work demonstrates that self-supervised foundation models, well established in imaging and language, transfer effectively to synchronized cardiac biosignals, with pretraining on unlabeled clinical recordings delivering measurable gains on label-scarce detection tasks. It is an industry-led example of leveraging proprietary device data at scale for medical AI. A notable limitation for the open-research community is reproducibility: neither the code nor the model weights are publicly released, and the authors state the implementation code "will be made available upon request," so independent verification and reuse are constrained.

Citations

Foundation models for cardiovascular disease detection via biosignals from digital stethoscopes

Mathew, G., et al. (2024) Foundation models for cardiovascular disease detection via biosignals from digital stethoscopes. npj Cardiovascular Health.

DOI: 10.1038/s44325-024-00027-5

Foundation Models for Cardiovascular Disease Detection via BioSignals from Digital Stethoscopes

Mathew, G., et al. (2024) Foundation Models for Cardiovascular Disease Detection via BioSignals from Digital Stethoscopes. Springer Science and Business Media LLC.

DOI: 10.21203/rs.3.rs-4732737/v1

Recent citations

Papers that recently cited this model.

ECGMind: A Foundation Model for ECG Classification via Dynamic Energy Guidance
Yuqi She, Junhao Huang, Erke Wang, et al.
2026
0
Translating AI Into the Eye Clinic-From Models to Clinical Workflow.
Di Zhang, Ya Xing Wang, Y. Tham, et al.
JAMA ophthalmology · Jul 2026
0
Comprehensive Dataset and Signal Processing Framework for Phonocardiogram-Based Heart Rate and Blood Pressure Estimation
Abdullah Mamun, Utsab Saha, Mahmudul Hasan, et al.
May 2026
0

Top citations

The most-cited papers that cite this model.

Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook
Ming Jin, Qingsong Wen, Yuxuan Liang, et al.
arXiv.org · Oct 2023
184
Foundation Model
Jingping Nie, D. Tran, Karan Thakkar, et al.
40
Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals
Yunfei Luo, Yuliang Chen, Asif Salekin, et al.
ACM Transactions on Computing for Healthcare · Dec 2024
27
Advances in cardiovascular signal analysis with future directions: a review of machine learning and deep learning models for cardiovascular disease classification based on ECG, PCG, and PPG signals
Y. Fuadah, Ki Moo Lim
Biomedical Engineering Letters · Apr 2025
19
Promoting cross-modal representations to improve multimodal foundation models for physiological signals
Ching Fang, Chris Sandino, Behrooz Mahasseni, et al.
arXiv.org · Oct 2024
17

Citations

Total Citations31

Influential2

References69

Fields of citing research

Medicine90%
Computer Science87%
Engineering58%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

22Closed

Usability — can I run it?15

Reproducibility — can I retrain it?14

Model Openness Framework

Unclassified

Missing required components

Resources

Research Paper Official Website

Key Features

Self-supervised pretraining on routine recordings: Models are pretrained with a masked autoencoder objective on up to 1,890,304 unlabeled PCG recordings collected from Eko devices in everyday clinical use, removing the need for labels during the representation-learning phase.

Multi-modal stethoscope signals: Separate PCG and ECG encoders, plus a combined PCG-ECG variant, exploit the two synchronized signals a digital stethoscope captures at a single auscultation site.

Signal-specific masking: PCG mel-spectrograms use 16×16 patches with 70% masking, while ECG uses 1×625 temporal segments with 30% masking, tailoring the reconstruction task to each modality.

Transferable to multiple cardiac tasks: The pretrained encoder is fine-tuned for structural-heart-disease murmur detection, atrial fibrillation, and low ejection fraction, three clinically distinct screening problems.

Technical Details

Applications

Impact

Citations

Foundation models for cardiovascular disease detection via biosignals from digital stethoscopes

Mathew, G., et al. (2024) Foundation models for cardiovascular disease detection via biosignals from digital stethoscopes. npj Cardiovascular Health.

DOI: 10.1038/s44325-024-00027-5

Foundation Models for Cardiovascular Disease Detection via BioSignals from Digital Stethoscopes

Mathew, G., et al. (2024) Foundation Models for Cardiovascular Disease Detection via BioSignals from Digital Stethoscopes. Springer Science and Business Media LLC.

DOI: 10.21203/rs.3.rs-4732737/v1

Recent citations

Papers that recently cited this model.

ECGMind: A Foundation Model for ECG Classification via Dynamic Energy Guidance

Yuqi She, Junhao Huang, Erke Wang, et al.

2026

Translating AI Into the Eye Clinic-From Models to Clinical Workflow.

Di Zhang, Ya Xing Wang, Y. Tham, et al.

JAMA ophthalmology · Jul 2026

Comprehensive Dataset and Signal Processing Framework for Phonocardiogram-Based Heart Rate and Blood Pressure Estimation

Abdullah Mamun, Utsab Saha, Mahmudul Hasan, et al.

May 2026

Eko Digital Stethoscope CVD Foundation Model

#Key Features

#Technical Details

#Applications

#Impact

Citations

Foundation models for cardiovascular disease detection via biosignals from digital stethoscopes

Foundation Models for Cardiovascular Disease Detection via BioSignals from Digital Stethoscopes

Recent citations

Comprehensive Dataset and Signal Processing Framework for Phonocardiogram-Based Heart Rate and Blood Pressure Estimation

Top citations

Foundation Model

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Eko Digital Stethoscope CVD Foundation Model

#Key Features

#Technical Details

#Applications

#Impact

Citations

Foundation models for cardiovascular disease detection via biosignals from digital stethoscopes

Foundation Models for Cardiovascular Disease Detection via BioSignals from Digital Stethoscopes

Recent citations

Comprehensive Dataset and Signal Processing Framework for Phonocardiogram-Based Heart Rate and Blood Pressure Estimation

Top citations

Foundation Model

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact