BrainLM

Yale University / Baylor College of Medicine / Princeton University

fMRI foundation model pretrained with masked autoencoding on roughly 6,700 hours of recordings for clinical prediction and network discovery.

Released: September 2023

BrainLM (Brain Language Model) is a foundation model for functional magnetic resonance imaging (fMRI) recordings, designed to learn a general-purpose representation of human brain activity dynamics. Rather than training a bespoke model for each neuroimaging task, BrainLM follows the self-supervised pretraining paradigm that transformed natural language and protein modeling: it is trained once on a large corpus of unlabeled brain recordings and then adapted, through fine-tuning or zero-shot inference, to a range of downstream problems. It was developed by researchers in David van Dijk's lab at Yale University, with collaborators at Baylor College of Medicine and Princeton University, and presented at ICLR 2024.

The model addresses a long-standing bottleneck in computational neuroscience: fMRI datasets are individually small and heterogeneous, making it difficult to train deep models that generalize across cohorts, scanners, and tasks. By pretraining on roughly 6,700 hours of fMRI from large population studies, BrainLM learns spatiotemporal structure that transfers to new datasets it never saw during training, including external clinical cohorts.

BrainLM is notable as one of the first large-scale foundation models built directly on whole-brain fMRI time series rather than on task-specific labels or static connectivity matrices. It demonstrates that brain recordings, like text or protein sequences, contain enough self-supervisory signal to support a single reusable backbone for neuroscience.

Key Features

Masked-prediction pretraining: BrainLM is trained to reconstruct masked segments of parcel time series (at masking ratios up to 90%), forcing the model to learn the underlying dynamics of brain activity without any labels.
Clinical variable prediction: After fine-tuning, the model predicts metadata and clinical variables such as age, neuroticism, anxiety, and PTSD scores directly from fMRI recordings.
Brain state forecasting: BrainLM can extrapolate future brain activity from a window of past recordings, treating fMRI as a sequence-modeling problem.
Zero-shot functional network discovery: Without any network-level supervision, the model's attention recovers intrinsic functional networks from raw fMRI, recapitulating known resting-state organization.
Cross-cohort generalization: Pretraining on large population data lets BrainLM transfer to entirely new external cohorts not seen during training.

Technical Details

BrainLM uses a Vision Transformer masked autoencoder (ViTMAE) architecture applied to fMRI time series parcellated with the AAL-424 atlas, yielding 424 regional signals sampled at roughly 1 Hz. The model is trained with a mean-squared-error reconstruction objective over masked spatiotemporal patches. Pretraining used approximately 6,700 hours of recordings: about 6,450 hours (76,296 recordings) from the UK Biobank and about 250 hours (1,002 recordings) from the Human Connectome Project, with motion correction, normalization, temporal filtering, and ICA denoising, split 80/10/10 into train/validation/test. Two checkpoints are released, with 111 million and 650 million parameters; the larger model uses flash attention. Training ran for 100 epochs with a batch size of 512 using the Adam optimizer.

Applications

BrainLM is aimed at neuroscientists and clinical researchers working with fMRI who want a pretrained backbone rather than training models from scratch on small studies. Practical uses include decoding cognitive and mental-health variables, forecasting future brain states, simulating the effects of interventions on brain dynamics through prompting, and discovering functional networks in an unsupervised way. Because pretrained weights are publicly available on HuggingFace, groups with limited labeled data can fine-tune for their own biomarkers or diagnostic targets.

Impact

BrainLM helped establish the foundation-model paradigm for brain activity recordings, showing that large-scale self-supervised pretraining on fMRI produces representations that transfer across cohorts and tasks. Its public 111M- and 650M-parameter checkpoints and ICLR 2024 publication have made it a reference point for subsequent neuroimaging foundation models. Important limitations remain: pretraining used only healthy adults, so generalization to clinical populations is uncertain; the approach is currently specific to fMRI and untested on other modalities; and BOLD fMRI is itself an indirect proxy for neural activity. The pretrained weights are released under a CC BY-NC-ND 4.0 license, and the UK Biobank training data requires a separate access application.

Citation

BrainLM: A foundation model for brain activity recordings

Preprint

Caro, J. O., et al. (2024) BrainLM: A foundation model for brain activity recordings. bioRxiv.

DOI: 10.1101/2023.09.12.557460

Recent citations

Papers that recently cited this model.

BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure
Zijian Dong, Yi Lin, Jixiang Fang, et al.
Jul 2026
0
BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language
Haitao Wu, Qirui Zhang, Zhouheng Yao, et al.
Jun 2026
0
Beyond Single-Source Cognitive Taskonomy:Multi-Source Task Relations through fMRI Transfer Learning
Junfeng Xia, Wendu Li, Mengjiao Zhang, et al.
Jun 2026
0

Top citations

The most-cited papers that cite this model.

Neuro-GPT: Towards A Foundation Model For EEG
Wenhui Cui, Woojae Jeong, Philipp Tholke, et al.
IEEE International Symposium on Biomedical Imaging · Nov 2023
93
Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking
Zijian Dong, Ruilin Li, Yilei Wu, et al.
Neural Information Processing Systems · Sep 2024
69Influential
BrainMass: Advancing Brain Network Analysis for Diagnosis With Large-Scale Self-Supervised Learning
Yanwu Yang, Chenfei Ye, Guinan Su, et al.
IEEE Transactions on Medical Imaging · Mar 2024
50
Data-Centric Foundation Models in Computational Healthcare: A Survey
Yunkun Zhang, Jin Gao, Zheling Tan, et al.
ACM Computing Surveys · Jan 2024
42Influential
Brain Foundation Models: A survey on advancements in neural signal processing and brain discovery
Xin-qiu Zhou, Chenyu Liu, Zhisheng Chen, et al.
IEEE Signal Processing Magazine · Mar 2025
34

Citations

Total Citations131

Influential22

References34

GitHub

Stars18

Forks2

Open Issues17

Contributors4

Last Push8mo ago

LanguageJupyter Notebook

HuggingFace

Downloads0

Likes20

Last Modified2y ago

Fields of citing research

Computer Science96%
Medicine64%
Biology39%
Engineering22%
Physics4%
Psychology4%
Mathematics4%
Linguistics2%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

25Closed

Usability — can I run it?20

Reproducibility — can I retrain it?15

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper HuggingFace Model

Key Features

Masked-prediction pretraining: BrainLM is trained to reconstruct masked segments of parcel time series (at masking ratios up to 90%), forcing the model to learn the underlying dynamics of brain activity without any labels.

Clinical variable prediction: After fine-tuning, the model predicts metadata and clinical variables such as age, neuroticism, anxiety, and PTSD scores directly from fMRI recordings.

Brain state forecasting: BrainLM can extrapolate future brain activity from a window of past recordings, treating fMRI as a sequence-modeling problem.

Zero-shot functional network discovery: Without any network-level supervision, the model's attention recovers intrinsic functional networks from raw fMRI, recapitulating known resting-state organization.

Cross-cohort generalization: Pretraining on large population data lets BrainLM transfer to entirely new external cohorts not seen during training.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure

Zijian Dong, Yi Lin, Jixiang Fang, et al.

Jul 2026

BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language

Haitao Wu, Qirui Zhang, Zhouheng Yao, et al.

Jun 2026

Beyond Single-Source Cognitive Taskonomy:Multi-Source Task Relations through fMRI Transfer Learning

Junfeng Xia, Wendu Li, Mengjiao Zhang, et al.

Jun 2026

BrainLM

#Key Features

#Technical Details

#Applications

#Impact

Citation

BrainLM: A foundation model for brain activity recordings

Recent citations

BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure

BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language

Beyond Single-Source Cognitive Taskonomy:Multi-Source Task Relations through fMRI Transfer Learning

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

BrainLM

#Key Features

#Technical Details

#Applications

#Impact

Citation

BrainLM: A foundation model for brain activity recordings

Recent citations

BrainFIBRE: A Foundation Model via Information Decomposition for Brain Microstructure

BrainJanus: A Unified Model for Understanding and Generation across Brain, Vision, and Language

Beyond Single-Source Cognitive Taskonomy:Multi-Source Task Relations through fMRI Transfer Learning

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact