BrainWave (Brant-2)

Foundation model spanning invasive SEEG/iEEG and non-invasive EEG in one backbone, with zero- and few-shot transfer across neurological disorders.

Released: February 2024

Parameters: 86 Million

BrainWave (developed under the codename Brant-2) is a foundation model for brain electrical signals built by the BrainNet group at Zhejiang University (Zhizhang Yuan and colleagues), first released as an arXiv preprint in February 2024. It is positioned as the first foundation model to span both invasive recordings — stereo-EEG (SEEG) and intracranial EEG (iEEG) — and non-invasive scalp EEG within a single pretrained backbone, where prior efforts typically targeted one modality.

The core problem BrainWave addresses is the extreme heterogeneity of neural recordings: sampling rates, channel counts, electrode montages, recording sites, and patient populations all vary widely, which has historically forced researchers to train bespoke models per dataset and per disorder. By pretraining a single channel-agnostic encoder on a very large, mixed corpus, BrainWave learns representations that transfer across recording conditions and across diseases, including in zero-shot and few-shot settings without task-specific fine-tuning.

BrainWave is the successor to the authors' earlier Brant model (which focused on intracranial signals). Brant-2/BrainWave broadens the scope to unify invasive and non-invasive modalities and to target clinical diagnosis of neurological disorders, making it a notable entry in the growing landscape of biosignal foundation models.

Key Features

Cross-modality coverage: A single backbone ingests both invasive (iEEG/SEEG, 48–238 channels) and non-invasive (EEG, 1–64 channels) recordings, rather than specializing in one signal type.
Scale-alignment layer: Each 1-second signal patch is converted to a spectrogram with constant time–frequency resolution, letting the model handle diverse sampling rates (EEG ~100–1024 Hz; iEEG ~1000–4096 Hz) without resampling.
Channel-agnostic attention: Channels are encoded independently, then a bidirectional self-attention stage models inter-channel correlations, so the model accepts variable channel counts and montages.
Zero- and few-shot transfer: BrainWave supports cross-hospital and cross-subtype zero-shot transfer and strong 3-/8-shot classification, reducing reliance on large labeled clinical datasets.
Broad clinical scope: Evaluated on epilepsy/seizure detection, Alzheimer's disease, major depressive disorder, schizophrenia, and ADHD.

Technical Details

BrainWave is a RoBERTa-style transformer encoder with roughly 86 million parameters: 10 layers, 16 attention heads, hidden size 768, and a maximum sequence of 61 tokens (60 one-second signal patches plus a [CLS] token). It is pretrained by masked reconstruction of time–frequency representations over ~3.16 billion signal patches. The pretraining corpus totals 13.79 TB across 40,907 hours from roughly 16,000 individuals — 10.63 TB of iEEG (5,231 hours, 91 subjects) and 3.16 TB of EEG (35,675 hours, 15,906 subjects), drawing on public datasets such as TUEG, Sleep-EDF, CAP, HMC, Siena, SRM, and CCEP alongside private corpora. Across 13 downstream datasets and 20+ tasks, the authors report average gains of roughly 11.9% AUROC for cross-subject diagnosis over the next-best baseline, zero-shot cross-hospital seizure transfer around 93.8% AUROC, and 8-shot results such as ~91.9% AUROC on absence-seizure data and ~89.8% AUROC for major depressive disorder.

Applications

BrainWave targets clinical neurophysiology workflows where labeled data is scarce and recording setups differ across sites. Use cases demonstrated in the paper include seizure detection and seizure-onset-zone localization, cross-hospital and cross-subtype seizure transfer, and screening or biomarker prediction for Alzheimer's disease, depression, schizophrenia, and ADHD. Because the same encoder serves both intracranial monitoring (e.g., presurgical epilepsy evaluation) and routine scalp EEG, it is of interest to epileptologists, sleep and psychiatric researchers, and machine-learning groups building diagnostic tools that must generalize across hospitals and patient populations.

Impact

BrainWave is one of the first efforts to unify invasive and non-invasive brain recordings under a single foundation model, extending the authors' earlier Brant work and contributing to the rapid emergence of biosignal foundation models. Its emphasis on zero- and few-shot clinical transfer is significant for settings where collecting large labeled corpora is impractical. Important caveats remain: the strongest results are on internal benchmarks reported in a preprint (peer review of the latest version was ongoing), and as of this writing the authors state that code and model weights will be released upon publication, so independent reproduction and broad adoption are still pending.

Citation

BrainWave: A Brain Signal Foundation Model for Clinical Applications

Preprint

Yuan, Z., et al. (2024) BrainWave: A Brain Signal Foundation Model for Clinical Applications.

DOI: 10.48550/arXiv.2402.10251

Recent citations

Papers that recently cited this model.

RECTOR: Masked Region-Channel-Temporal Modeling for Affective and Cognitive Representation Learning
Jinhan Liu, Mahsa Shoaran
Jun 2026
0
ST-CoG-XAI: A Spectrotemporal Contrastive Generation Foundation Model for Explainable EEG Decoding
Jiaqi Ding, He Wang, Changsheng Wu, et al.
IEEE Journal on Flexible Electronics · Jun 2026
0
NeuroAtlas: Benchmarking Foundation Models for Clinical EEG and Brain-Computer Interfaces
Konstantinos Kontras, Trui Osselaer, Stylianos G Mouslech, et al.
May 2026
0

Top citations

The most-cited papers that cite this model.

CSBrain: A Cross-scale Spatiotemporal Brain Foundation Model for EEG Decoding
Yuchen Zhou, Jiamin Wu, Zichen Ren, et al.
arXiv.org · Jun 2025
35
Brain Foundation Models: A survey on advancements in neural signal processing and brain discovery
Xin-qiu Zhou, Chenyu Liu, Zhisheng Chen, et al.
IEEE Signal Processing Magazine · Mar 2025
34
BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals
Qinfan Xiao, Ziyun Cui, Chi Zhang, et al.
arXiv.org · May 2025
24
The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning
D. Jayalath, Gilad Landau, Brendan Shillingford, et al.
International Conference on Machine Learning · Jun 2024
22
EEG foundation models: a critical review of current progress and future directions
Gayal Kuruppu, Neeraj Wagh, V. Kremen, et al.
Journal of Neural Engineering · Jul 2025
20Influential

Citations

Total Citations31

Influential2

References69

GitHub

Stars47

Forks2

Open Issues1

Contributors1

Last Push1y ago

Fields of citing research

Computer Science100%
Engineering55%
Medicine52%
Biology23%
Physics3%
Psychology3%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

10Closed

Usability — can I run it?7

Reproducibility — can I retrain it?12

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper

Key Features

Cross-modality coverage: A single backbone ingests both invasive (iEEG/SEEG, 48–238 channels) and non-invasive (EEG, 1–64 channels) recordings, rather than specializing in one signal type.

Scale-alignment layer: Each 1-second signal patch is converted to a spectrogram with constant time–frequency resolution, letting the model handle diverse sampling rates (EEG ~100–1024 Hz; iEEG ~1000–4096 Hz) without resampling.

Channel-agnostic attention: Channels are encoded independently, then a bidirectional self-attention stage models inter-channel correlations, so the model accepts variable channel counts and montages.

Zero- and few-shot transfer: BrainWave supports cross-hospital and cross-subtype zero-shot transfer and strong 3-/8-shot classification, reducing reliance on large labeled clinical datasets.

Broad clinical scope: Evaluated on epilepsy/seizure detection, Alzheimer's disease, major depressive disorder, schizophrenia, and ADHD.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

RECTOR: Masked Region-Channel-Temporal Modeling for Affective and Cognitive Representation Learning

Jinhan Liu, Mahsa Shoaran

Jun 2026

ST-CoG-XAI: A Spectrotemporal Contrastive Generation Foundation Model for Explainable EEG Decoding

Jiaqi Ding, He Wang, Changsheng Wu, et al.

IEEE Journal on Flexible Electronics · Jun 2026

NeuroAtlas: Benchmarking Foundation Models for Clinical EEG and Brain-Computer Interfaces

Konstantinos Kontras, Trui Osselaer, Stylianos G Mouslech, et al.

May 2026

BrainWave (Brant-2)

#Key Features

#Technical Details

#Applications

#Impact

Citation

BrainWave: A Brain Signal Foundation Model for Clinical Applications

Recent citations

RECTOR: Masked Region-Channel-Temporal Modeling for Affective and Cognitive Representation Learning

NeuroAtlas: Benchmarking Foundation Models for Clinical EEG and Brain-Computer Interfaces

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

BrainWave (Brant-2)

#Key Features

#Technical Details

#Applications

#Impact

Citation

BrainWave: A Brain Signal Foundation Model for Clinical Applications

Recent citations

RECTOR: Masked Region-Channel-Temporal Modeling for Affective and Cognitive Representation Learning

NeuroAtlas: Benchmarking Foundation Models for Clinical EEG and Brain-Computer Interfaces

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact