An EEG foundation model that learns generic brain-signal representations via a vector-quantized neural tokenizer and masked-transformer pretraining on ~2,500 hours of data.
LaBraM (Large Brain Model) is a foundation model for electroencephalography (EEG) that learns generic, transferable representations of brain activity rather than being tuned to a single dataset or task. EEG research has long been fragmented: recordings differ in the number and placement of electrodes, sampling rates, and recording duration, which forces most deep-learning approaches to train narrow, dataset-specific models that fail to exploit the growing body of available data. LaBraM addresses this by treating raw EEG channels as patchable signals that can be tokenized and modeled with a transformer, enabling pretraining across many heterogeneous datasets and montages.
Developed by Wei-Bang Jiang, Li-Ming Zhao, and Bao-Liang Lu at Shanghai Jiao Tong University, LaBraM was introduced in a paper submitted to arXiv on 29 May 2024 and presented as a Spotlight at ICLR 2024. It is one of the first EEG foundation models to combine a learned discrete tokenizer with large-scale masked pretraining, and it became the predecessor to NeuroLM, the authors' later multimodal EEG-language model.
The core idea is to decouple representation learning from any single downstream objective. By segmenting EEG into channel patches and learning a neural codebook that captures spectral structure, LaBraM can be pretrained self-supervised on unlabeled data and then fine-tuned on diverse brain-computer interface (BCI) tasks with minimal architectural changes.
LaBraM uses a two-stage recipe. First, a vector-quantized neural spectrum prediction
(VQ-NSP) tokenizer is trained to reconstruct the Fourier amplitude and phase of EEG
patches, yielding a codebook of 8192 discrete embeddings (each 64-dimensional). Second,
a neural transformer is pretrained with a masked-modeling objective: a subset of
channel patches is masked and the model predicts their codebook indices. Pretraining
used about 2,500 hours of EEG aggregated from around 20 public datasets. The released
checkpoints include the VQ-NSP tokenizer (vqnsp.pth) and the pretrained
labram-base transformer. On the TUAB abnormal-detection benchmark, LaBraM-Base
reaches 0.814 balanced accuracy and 0.902 AUROC; on the six-class TUEV event-type
task it reports 0.641 balanced accuracy, 0.664 Cohen's kappa, and 0.831 weighted F1,
outperforming prior task-specific baselines.
LaBraM targets BCI and clinical EEG workflows where labeled data is scarce but unlabeled recordings are plentiful. After self-supervised pretraining, it can be fine-tuned for pathology screening (e.g., abnormal-EEG detection), seizure and event-type classification, affective computing (emotion recognition), and motor or gait decoding. Because the tokenizer accepts arbitrary channel configurations, researchers can apply a single pretrained backbone to recordings from different hardware without redesigning the model, lowering the barrier to building EEG classifiers in neurology, sleep research, and human-machine interaction.
LaBraM helped establish the foundation-model paradigm for EEG, showing that a single self-supervised backbone can match or beat bespoke models across several benchmarks. Its ICLR 2024 Spotlight and openly released code and weights (MIT license, 600+ GitHub stars) made it a widely used reference point for subsequent EEG and biosignal foundation models, and it directly seeded the authors' follow-up NeuroLM. Its main limitations mirror the field's: the largest released checkpoint is Base, evaluation centers on a handful of public benchmarks, and cross-subject and cross-montage robustness in deployment remain active research questions.
Jiang, W., et al. (2024) Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI. International Conference on Learning Representations.
DOI: 10.48550/arXiv.2405.18765Papers that recently cited this model.
The most-cited papers that cite this model.
Not enough data