bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Biosignals foundation models
Biosignals

BrainMass

Harbin Institute of Technology (Shenzhen) / Peng Cheng Laboratory

Self-supervised foundation model for functional brain network analysis from fMRI, pretrained on ~70K samples across 30 datasets for disorder diagnosis.

Released: March 2024

BrainMass is a self-supervised foundation model for analyzing functional brain networks derived from resting-state functional MRI (fMRI). Brain network analysis — modeling the brain as a graph of regions connected by functional correlations — is a powerful tool for studying neurological and psychiatric disorders, but individual studies are typically small, heterogeneous, and expensive to label. This data scarcity has historically limited the generalizability of deep learning models in the field. BrainMass addresses this by pretraining a single Transformer encoder on a large, pooled corpus and transferring it to many downstream diagnostic tasks with little or no task-specific supervision.

Developed by researchers at the Harbin Institute of Technology (Shenzhen) and the Peng Cheng Laboratory, the model was introduced in a March 2024 preprint and published in IEEE Transactions on Medical Imaging later that year. The authors position BrainMass as the first foundation model dedicated to brain network analysis, aggregating 70,781 samples from 46,686 participants across 30 datasets to learn broadly transferable representations of functional connectivity.

By learning general-purpose embeddings of brain networks, BrainMass aims to streamline a wide range of clinical and neuroscience workflows that would otherwise each require their own annotated training set, while also surfacing interpretable biomarkers tied to specific disorders.

#Key Features

  • Brain network foundation model: Pretrains a Transformer encoder on functional connectivity matrices, producing representations that transfer across disorders, sites, and acquisition protocols.
  • Mask-ROI Modeling (MRM): A masked-modeling objective that hides regions of interest and reconstructs them, strengthening intra-network dependencies and regional specificity within each brain graph.
  • Latent Representation Alignment (LRA): An alignment objective that regularizes augmented views of the same brain network, encouraging consistent, robust embeddings.
  • Pseudo-functional connectivity (pFC): A data-augmentation scheme that randomly drops BOLD-signal timepoints to synthesize millions of additional brain networks, vastly expanding effective training scale.
  • Few-shot and zero-shot capability: Learned representations support diagnosis with minimal or no labeled fine-tuning data, lowering the annotation burden for new clinical questions.

#Technical Details

BrainMass operates on functional connectivity matrices computed from parcellated resting-state fMRI, where each node is a region of interest (ROI) and edges encode BOLD-signal correlations. The model is a Transformer encoder pretrained with two complementary self-supervised objectives: Mask-ROI Modeling, which masks and reconstructs ROI-level features to capture intra-network structure, and Latent Representation Alignment, which aligns embeddings of augmented network views. The pseudo-functional connectivity augmentation generates millions of synthetic networks by randomly dropping timepoints from the underlying BOLD time series, supplying the diversity needed for large-scale pretraining. Pretraining draws on 70,781 samples from 46,686 participants pooled across 30 public and in-house datasets. The authors evaluate on eight internal tasks and seven external brain-disorder diagnosis tasks, reporting superior performance over task-specific baselines along with strong few-shot and zero-shot transfer.

#Applications

BrainMass is aimed at computational neuroscientists and clinical researchers working with resting-state fMRI to study and diagnose brain disorders such as autism, Parkinson's disease, and other neurological and psychiatric conditions. Because the pretrained encoder transfers across sites and cohorts, it is especially useful for small studies that lack the data to train a model from scratch, and its few/zero-shot behavior allows rapid prototyping of new diagnostic tasks. Beyond classification, the model's attention over ROIs can highlight candidate biomarkers, supporting interpretability-focused analyses of disorder-specific connectivity patterns.

#Impact

BrainMass demonstrates that the foundation-model paradigm — large-scale self-supervised pretraining followed by lightweight transfer — extends naturally to functional brain networks, a domain long hampered by fragmented, small datasets. By pooling tens of thousands of scans and releasing code and pretrained weights, it provides a reusable starting point that lowers the barrier to building robust fMRI-based diagnostic models and contributes to a growing wave of neuroimaging foundation models (alongside efforts such as BrainLM). Its emphasis on interpretable biomarkers and zero-shot generalization is particularly relevant for translational applications where labeled clinical data are scarce.

Citation

BrainMass: Advancing Brain Network Analysis for Diagnosis With Large-Scale Self-Supervised Learning

Yang, Y., et al. (2024) BrainMass: Advancing Brain Network Analysis for Diagnosis With Large-Scale Self-Supervised Learning. IEEE Transactions on Medical Imaging.

DOI: 10.1109/TMI.2024.3414476

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations50
Influential12
References94

GitHub

Stars25
Forks4
Open Issues1
Contributors1
Last Push1y ago
LanguagePython

Fields of citing research

Not enough data

Openness

bio.rodeo opennessClosed · low usability and reproducibility
18Closed
Usability — can I run it?21
Reproducibility — can I retrain it?16
Model Openness Framework
Unclassified
Restrictive license on core components

Tags

brain_networkdisease_diagnosisfmrifoundation_modelrepresentation_learningself_supervisedtransformerzero_shot

Resources

GitHub RepositoryResearch PaperResearch Paper