CREMA

Self-supervised foundation model for 12-lead ECG, pairing masked autoencoder pretraining with contrastive regularization for robust diagnostics.

Released: June 2024

CREMA (Contrastive Regularized Masked Autoencoder) is a self-supervised foundation model for 12-lead electrocardiograms (ECGs), developed by Medical AI Co., Ltd. in Seoul, South Korea, and introduced in a preprint first posted in June 2024. It targets a persistent obstacle in clinical ECG analysis: models trained on one population or care setting often degrade sharply when applied to data from a different hospital, patient mix, or acquisition device. CREMA aims to learn ECG representations that transfer robustly across these distribution shifts, so that a single pretrained backbone can support many downstream diagnostic tasks without bespoke retraining for each site.

The core idea is to merge two complementary self-supervised signals that are usually used in isolation. A masked autoencoder (MAE) objective forces the model to reconstruct deliberately hidden segments of the ECG waveform, encouraging it to internalize the morphology of beats and rhythms. A contrastive regularization term, applied on top of the MAE, shapes the latent space so that semantically similar recordings stay close together while dissimilar ones are pushed apart. The authors frame this hybrid objective as a Contrastive Regularized MAE loss.

CREMA sits within the recent wave of ECG foundation models that adapt the self-supervised pretraining recipes of vision and language to physiological time series. Its distinguishing emphasis is robustness under clinical distribution shift rather than performance on a single curated benchmark.

Key Features

Hybrid self-supervised objective: Combines masked-signal reconstruction with a contrastive regularization term, blending generative and contrastive learning so representations capture both fine waveform detail and discriminative structure.
Signal Transformer backbone: Uses a transformer architecture adapted for physiological signals to model both local waveform morphology and long-range temporal dependencies across the cardiac cycle.
12-lead coverage: Operates on full 12-lead ECGs, the standard diagnostic acquisition used across emergency, inpatient, and outpatient cardiology.
Robustness to distribution shift: Explicitly evaluated on distribution-shifted clinical data to test whether learned representations hold up across heterogeneous care settings rather than only on in-distribution test splits.
Flexible downstream use: Supports both linear probing of frozen features and full fine-tuning, letting users trade off compute and labeled-data requirements against task accuracy.

Technical Details

CREMA pretrains a Signal Transformer encoder on 12-lead ECG recordings using the Contrastive Regularized MAE objective: portions of the input signal are masked and reconstructed, while a contrastive term regularizes the embedding space. After pretraining, the encoder is evaluated under two standard protocols. In linear probing, the backbone is frozen and only a lightweight classifier head is trained, isolating the quality of the learned representations. In fine-tuning, the full network is adapted end-to-end to each task. Across these protocols the authors report that CREMA outperforms supervised baselines and prior self-supervised ECG models, with the advantage most pronounced on distribution-shifted clinical data where conventional models tend to lose accuracy. The preprint emphasizes the qualitative robustness gains across clinical domains; readers should consult the paper for the exact datasets, cohort sizes, and per-task metrics.

Applications

CREMA is intended as a reusable backbone for ECG-based diagnostic tasks such as detecting arrhythmias, conduction abnormalities, and other cardiac conditions from standard 12-lead recordings. Because it is pretrained once and adapted via probing or fine-tuning, it is well suited to clinical settings where labeled data are scarce or where a model must generalize across hospitals and devices. The underlying technology is deployed in production within Medical AI Co.'s AiTiA clinical decision-support platform, which the company reports is used across 50-plus hospitals, illustrating the model family's path from research into operational cardiology workflows.

Impact

CREMA contributes to the growing effort to build transferable foundation models for physiological signals, and its specific focus on robustness under clinical distribution shift addresses a failure mode that limits the real-world deployment of many ECG classifiers. Its integration into a fielded clinical platform serving dozens of hospitals signals practical relevance beyond benchmark leaderboards. A key limitation for the research community is openness: the model weights and training code are not publicly released, as the work is commercial, so independent reproduction and external benchmarking are constrained and the published preprint is the primary source for its claims.

Citation

CREMA: A Contrastive Regularized Masked Autoencoder for Robust ECG Diagnostics across Clinical Domains

Preprint

Song, J., et al. (2024) CREMA: A Contrastive Regularized Masked Autoencoder for Robust ECG Diagnostics across Clinical Domains.

DOI: 10.48550/arXiv.2407.07110

Recent citations

Papers that recently cited this model.

Extending Pretrained 10-Second ECG Foundation Models to Longer Horizons
Wei Tang, Jinpei Han, Kangning Cui, et al.
May 2026
0
Can Meta-Learning Address The Challenges of Biosignal Personalization?
Y. Jo, Jong-Hwan Jang, Jaeho Park, et al.
IEEE International Conference on Acoustics, Speech, and Signal Processing · May 2026
0
Position: Evaluation of ECG Representations Must Be Fixed
Zachary Berger, Daniel Prakah-Asante, John Guttag, et al.
arXiv.org · Feb 2026
0

Top citations

The most-cited papers that cite this model.

ECG-FM: An Open Electrocardiogram Foundation Model
Kaden McKeen, Laura Oliva, Sameer Masood, et al.
arXiv.org · Aug 2024
88
From Token to Rhythm: A Multi-Scale Approach for ECG-Language Pretraining
Fuying Wang, Jiacheng Xu, Lequan Yu
International Conference on Machine Learning · Jun 2025
16
OpenECG: Benchmarking ECG Foundation Models with Public 1.2 Million Records
Zhijiang Wan, Qianhao Yu, J. Mao, et al.
arXiv.org · Mar 2025
9
FoundationalECGNet: A Lightweight Foundational Model for ECG-based Multitask Cardiac Analysis
M. Sk., Md Jobayer, M. M. H. Shawon, et al.
arXiv.org · Sep 2025
1
Position: Evaluation of ECG Representations Must Be Fixed
Zachary Berger, Daniel Prakah-Asante, John Guttag, et al.
arXiv.org · Feb 2026
0

Citations

Total Citations7

Influential0

References39

Fields of citing research

Computer Science100%
Medicine86%
Biology14%
Engineering14%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

10Closed

Usability — can I run it?7

Reproducibility — can I retrain it?13

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

Hybrid self-supervised objective: Combines masked-signal reconstruction with a contrastive regularization term, blending generative and contrastive learning so representations capture both fine waveform detail and discriminative structure.

Signal Transformer backbone: Uses a transformer architecture adapted for physiological signals to model both local waveform morphology and long-range temporal dependencies across the cardiac cycle.

12-lead coverage: Operates on full 12-lead ECGs, the standard diagnostic acquisition used across emergency, inpatient, and outpatient cardiology.

Robustness to distribution shift: Explicitly evaluated on distribution-shifted clinical data to test whether learned representations hold up across heterogeneous care settings rather than only on in-distribution test splits.

Flexible downstream use: Supports both linear probing of frozen features and full fine-tuning, letting users trade off compute and labeled-data requirements against task accuracy.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

Extending Pretrained 10-Second ECG Foundation Models to Longer Horizons

Wei Tang, Jinpei Han, Kangning Cui, et al.

May 2026

Can Meta-Learning Address The Challenges of Biosignal Personalization?

Y. Jo, Jong-Hwan Jang, Jaeho Park, et al.

IEEE International Conference on Acoustics, Speech, and Signal Processing · May 2026

Position: Evaluation of ECG Representations Must Be Fixed

Zachary Berger, Daniel Prakah-Asante, John Guttag, et al.

arXiv.org · Feb 2026

CREMA

#Key Features

#Technical Details

#Applications

#Impact

Citation

CREMA: A Contrastive Regularized Masked Autoencoder for Robust ECG Diagnostics across Clinical Domains

Recent citations

Extending Pretrained 10-Second ECG Foundation Models to Longer Horizons

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

CREMA

#Key Features

#Technical Details

#Applications

#Impact

Citation

CREMA: A Contrastive Regularized Masked Autoencoder for Robust ECG Diagnostics across Clinical Domains

Recent citations

Extending Pretrained 10-Second ECG Foundation Models to Longer Horizons

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact