bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Biosignals foundation models
Biosignals

BrainOmni

Tsinghua University / Shanghai AI Laboratory / University of Cambridge / University College London

A brain foundation model that unifies EEG and MEG signals via a shared discrete tokenizer and self-supervised pretraining on 2,653 hours of recordings.

Released: May 2025
Parameters: 33 Million

Electroencephalography (EEG) and magnetoencephalography (MEG) both measure the electromagnetic activity of populations of neurons, but they are typically modeled in isolation: EEG and MEG use different sensors, different channel layouts, and different recording montages, so a decoder trained on one configuration rarely transfers to another. This fragmentation has limited brain-signal modeling to small, dataset-specific networks, even as foundation models have transformed genomics, proteins, and natural language. BrainOmni addresses this gap by learning a single representation that spans both modalities and arbitrary sensor arrangements.

Introduced in May 2025 and accepted to NeurIPS 2025, BrainOmni is presented by its authors as the first brain foundation model to unify EEG and MEG. It was developed by a collaboration led by Chao Zhang's group at Tsinghua University (the OpenTSLab team), together with Shanghai AI Laboratory, the MRC Cognition and Brain Sciences Unit at the University of Cambridge, and University College London. The model is released openly with code and pretrained weights under an MIT license.

The core idea is a two-stage design: a tokenizer first converts heterogeneous raw recordings into a shared vocabulary of discrete brain tokens, and a transformer is then pretrained on these tokens with a self-supervised objective. This decouples the model from any specific electrode or sensor layout and lets EEG and MEG contribute to the same representation space.

#Key Features

  • Unified EEG and MEG modeling: A single pretrained model ingests both EEG and MEG recordings, rather than requiring separate modality-specific decoders.
  • BrainTokenizer with sensor encoding: A residual vector-quantization tokenizer (4 codebook levels, 16 latent source variables) discretizes spatiotemporal brain activity, while a dedicated Sensor Encoder handles varying channel counts and montages.
  • Criss-cross transformer: BrainOmni applies separate spatial and temporal attention pathways so that both the sensor geometry and the time dynamics of brain signals are captured.
  • Self-supervised pretraining at scale: The model is pretrained on 1,997 hours of EEG and 656 hours of MEG curated from public datasets, with no task labels required.
  • Open release in two sizes: Tiny (8.4M parameters) and base (33M parameters) checkpoints, plus the tokenizer, are available on Hugging Face under an MIT license.

#Technical Details

BrainOmni separates signal quantization from semantic learning. BrainTokenizer encodes raw multi-channel recordings into discrete tokens using residual vector quantization with four codebook levels and 16 latent source variables, and a Sensor Encoder maps differing electrode or sensor configurations into a common space so heterogeneous datasets can be tokenized consistently. The BrainOmni backbone is a criss-cross transformer with distinct spatial and temporal attention, pretrained self-supervised over 2,653 total hours (1,997 EEG, 656 MEG) drawn from sources such as TUAB and numerous OpenNeuro datasets. Two sizes are released: 8.4M-parameter tiny and 33M-parameter base. On downstream benchmarks the base model reports balanced accuracies of 0.828 on AD65 (vs. 0.711 for LaBraM) and 0.819 on TUAB for EEG, 0.651 on the ASD74 MEG task, and 0.832 on a multimodal SomatoMotor benchmark, an improvement of roughly 20% over specialized baselines.

#Applications

BrainOmni is intended as a general-purpose backbone for neural decoding across both EEG and MEG. Researchers can fine-tune it for clinical and cognitive tasks such as detecting Alzheimer's disease or autism spectrum signatures, abnormal-EEG screening, motor-imagery classification for brain-computer interfaces, and somatomotor decoding. Because the tokenizer normalizes across montages and sensors, the model is especially useful when combining datasets recorded on different hardware, or when a target task has too little labeled data to train a decoder from scratch. The open MIT-licensed weights make it practical to adopt in academic neuroscience and BCI pipelines.

#Impact

By demonstrating that EEG and MEG can share a single tokenized representation, BrainOmni extends the foundation-model paradigm to multimodal neurophysiology and provides a route to pooling the field's fragmented public datasets. Its reported gains over modality-specific foundation models such as LaBraM and CBraMod, together with strong multimodal transfer, suggest that cross-modality pretraining is a productive direction for brain-signal modeling. As a recent NeurIPS 2025 contribution, its broader influence is still emerging, and the released checkpoints are modest in size (8.4M-33M parameters) compared with foundation models in other domains, leaving headroom for scaling in future work.

Citation

BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals

Preprint

Xiao, Q., et al. (2025) BrainOmni: A Brain Foundation Model for Unified EEG and MEG Signals. arXiv.org.

DOI: 10.48550/arXiv.2505.18185

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations23
Influential2
References77

GitHub

Stars67
Forks8
Open Issues1
Contributors1
Last Push6mo ago
LanguageJupyter Notebook

HuggingFace

Downloads0
Likes2
Last Modified7mo ago

Fields of citing research

Not enough data

Openness

bio.rodeo opennessFully open · usable and reproducible
80Open
Usability — can I run it?95
Reproducibility — can I retrain it?66
Model Openness Framework
Unclassified
Missing required components

Tags

brain_signal_classificationeegfoundation_modelmegmultimodalneural_decodingrepresentation_learningself_supervisedtransformervector_quantization

Resources

GitHub RepositoryResearch PaperHuggingFace Model