NeuroSTORM

Chinese University of Hong Kong / Massachusetts General Hospital / Yonsei University / University of Sydney / Peking University / University of Georgia / Lehigh University / Emory University

Spatiotemporal foundation model that learns representations directly from 4D functional MRI volumes for disease diagnosis and phenotype prediction.

Released: June 2025

Functional MRI (fMRI) measures brain activity through blood-oxygen-level-dependent (BOLD) signals, producing 4D data (three spatial dimensions over time) that is central to cognitive neuroscience and clinical neuroimaging. Yet most fMRI analysis pipelines remain bespoke: studies typically reduce the raw 4D volumes to handcrafted features such as region-of-interest time series or functional connectivity matrices, then train narrow, single-task models that transfer poorly across cohorts, scanners, and sites. This fragmentation contributes to well-documented reproducibility and generalization problems in brain imaging.

NeuroSTORM (Neuroimaging Foundation Model with Spatial-Temporal Optimized Representation Modeling) addresses this by learning generalizable representations directly from full 4D fMRI volumes, rather than from pre-extracted summaries. The model was developed by a multi-institution collaboration led by the Chinese University of Hong Kong with Massachusetts General Hospital, Yonsei University, the University of Sydney, Peking University, the University of Georgia, Lehigh University, and Emory University, first released as a preprint in June 2025 and published in Nature Biomedical Engineering in 2026.

Its central contribution is a scalable pretraining recipe paired with lightweight task adaptation, allowing a single pretrained backbone to be transferred efficiently to a wide range of downstream brain-imaging problems instead of training a new network for each study.

Key Features

Direct 4D volume modeling: NeuroSTORM operates on raw voxel-based 4D fMRI rather than ROI time series or connectivity matrices, preserving spatial and temporal structure that handcrafted features discard.
Shifted-Window Mamba backbone: A Shifted-Window Mamba (SWM) design combines linear-time state-space sequence modeling with a shifted scanning strategy, making it computationally feasible to process long, high-dimensional 4D volumes.
Large-scale, diverse pretraining: The model is pretrained on 28.65 million fMRI frames from more than 50,000 subjects across multiple centers and ages spanning 5 to 100 years.
Lightweight task adaptation: In addition to full fine-tuning, NeuroSTORM supports task-specific prompt learning, enabling fast transfer to new tasks with modest compute and data.
Broad task coverage: A single backbone supports five evaluation tasks — demographic prediction, phenotype prediction, disease diagnosis, re-identification (fMRI retrieval), and task-state classification.

Technical Details

NeuroSTORM is a self-supervised foundation model built on a Shifted-Window Mamba backbone, released in several configurations (including Base and Large variants, as well as low-resolution and long-sequence variants tuned for different compute and input regimes). Pretraining uses self-supervised objectives — masked autoencoding together with contrastive learning — applied directly to 4D volumes, drawing on 28.65 million frames from over 50,000 subjects across multiple sites and a wide age range. Downstream evaluation spans large open cohorts such as UK Biobank, the Adolescent Brain Cognitive Development (ABCD) study, the Human Connectome Project, ABIDE, and ADHD-200, alongside multi-hospital clinical datasets covering 17 diagnoses. Across these benchmarks NeuroSTORM reports consistent improvements over prior fMRI methods, with strong results on disease diagnosis and cognitive phenotype prediction and demonstrated clinical utility on data from hospitals in the United States, South Korea, and Australia.

Applications

NeuroSTORM is intended as a reusable backbone for both neuroscience research and clinical neuroimaging. Researchers can fine-tune or prompt-adapt the pretrained model for demographic and cognitive phenotype prediction, brain-state decoding, and subject re-identification, reducing the per-study engineering and labeling burden. Its demonstrated performance on multi-hospital cohorts across three countries points toward clinical decision-support applications such as psychiatric and neurological disease diagnosis from fMRI, where consistent, transferable representations across sites and scanners are particularly valuable.

Impact

NeuroSTORM extends the foundation-model paradigm to 4D functional neuroimaging, a modality that has lagged behind protein, genomic, and natural-image domains in part because of the cost of modeling full spatiotemporal volumes. By showing that a single backbone pretrained on tens of millions of frames can transfer across demographic, phenotypic, diagnostic, retrieval, and state-classification tasks, it offers a concrete template for addressing the reproducibility and transferability challenges that have limited fMRI machine learning. The code is released under the Apache-2.0 license at the CUHK-AIM-Group repository, supporting both pretraining and fine-tuning workflows; broad reuse will depend on the continued availability and documentation of the pretrained checkpoints.

Citation

Towards a general-purpose foundation model for functional MRI analysis.

Wang, C., et al. (2026) Towards a general-purpose foundation model for functional MRI analysis.. Nature Biomedical Engineering.

DOI: 10.1038/s41551-026-01666-y

Recent citations

Papers that recently cited this model.

Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions
Bo Zhang, Victor Cui, Tong Wu, et al.
Research · May 2026
0Influential
Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability
Dengzhe Hou, Zihao Wu, Lin Jiang, et al.
May 2026
1
Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI
Ruthwik Reddy Doodipala, Pankaj Pandey, Pratheek Eranki, et al.
May 2026
0

Top citations

The most-cited papers that cite this model.

Position: A Dynamical Systems Perspective is Needed to Advance Time Series Modeling
D. Durstewitz, Christoph Jurgen Hemmer, Florian Hess, et al.
Feb 2026
3
Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability
Dengzhe Hou, Zihao Wu, Lin Jiang, et al.
May 2026
1
Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI
Ruthwik Reddy Doodipala, Pankaj Pandey, Pratheek Eranki, et al.
May 2026
0
Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions
Bo Zhang, Victor Cui, Tong Wu, et al.
Research · May 2026
0Influential

Citations

Total Citations4

Influential1

References32

GitHub

Stars116

Forks21

Open Issues0

Contributors1

Last Push9d ago

LanguagePython

LicenseApache-2.0

Fields of citing research

Computer Science100%
Engineering50%
Medicine25%
Mathematics25%
Physics25%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

78Open

Usability — can I run it?94

Reproducibility — can I retrain it?57

Model Openness Framework

Unclassified

Missing required components

Resources

GitHub Repository Research Paper Research Paper

Key Features

Direct 4D volume modeling: NeuroSTORM operates on raw voxel-based 4D fMRI rather than ROI time series or connectivity matrices, preserving spatial and temporal structure that handcrafted features discard.

Shifted-Window Mamba backbone: A Shifted-Window Mamba (SWM) design combines linear-time state-space sequence modeling with a shifted scanning strategy, making it computationally feasible to process long, high-dimensional 4D volumes.

Large-scale, diverse pretraining: The model is pretrained on 28.65 million fMRI frames from more than 50,000 subjects across multiple centers and ages spanning 5 to 100 years.

Lightweight task adaptation: In addition to full fine-tuning, NeuroSTORM supports task-specific prompt learning, enabling fast transfer to new tasks with modest compute and data.

Broad task coverage: A single backbone supports five evaluation tasks — demographic prediction, phenotype prediction, disease diagnosis, re-identification (fMRI retrieval), and task-state classification.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions

Bo Zhang, Victor Cui, Tong Wu, et al.

Research · May 2026

0Influential

Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Dengzhe Hou, Zihao Wu, Lin Jiang, et al.

May 2026

Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI

Ruthwik Reddy Doodipala, Pankaj Pandey, Pratheek Eranki, et al.

May 2026

Top citations

The most-cited papers that cite this model.

Position: A Dynamical Systems Perspective is Needed to Advance Time Series Modeling

D. Durstewitz, Christoph Jurgen Hemmer, Florian Hess, et al.

Feb 2026

Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Dengzhe Hou, Zihao Wu, Lin Jiang, et al.

May 2026

Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI

Ruthwik Reddy Doodipala, Pankaj Pandey, Pratheek Eranki, et al.

May 2026

Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions

Bo Zhang, Victor Cui, Tong Wu, et al.

Research · May 2026

0Influential

NeuroSTORM

#Key Features

#Technical Details

#Applications

#Impact

Citation

Towards a general-purpose foundation model for functional MRI analysis.

Recent citations

Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions

Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI

Top citations

Position: A Dynamical Systems Perspective is Needed to Advance Time Series Modeling

Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI

Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

NeuroSTORM

#Key Features

#Technical Details

#Applications

#Impact

Citation

Towards a general-purpose foundation model for functional MRI analysis.

Recent citations

Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions

Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI

Top citations

Position: A Dynamical Systems Perspective is Needed to Advance Time Series Modeling

Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI

Foundation Models in Cancer Pathology: Techniques, Applications, and Future Directions

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact