Mass General Brigham / Dana-Farber Cancer Institute / Brigham and Women's Hospital / Harvard Medical School / Boston Children's Hospital
A self-supervised vision foundation model for structural brain MRI that produces general-purpose features adaptable to diverse downstream clinical and neuroscience tasks.
BrainIAC (Brain Imaging Adaptive Core) is a vision foundation model that learns generalized representations from unlabeled structural brain MRI and serves as a reusable backbone for a wide range of downstream clinical and neuroscience prediction tasks. Rather than training a separate bespoke network for each application, BrainIAC provides a single pretrained encoder that can be adapted — often with only a small amount of labeled data — to problems spanning radiology, neuro-oncology, and aging research.
The model was developed by investigators in the Artificial Intelligence in Medicine (AIM) program at Mass General Brigham, together with collaborators at Dana-Farber Cancer Institute, Brigham and Women's Hospital, Harvard Medical School, and Boston Children's Hospital. It was first released as a medRxiv preprint in December 2024 and subsequently published in Nature Neuroscience in 2026. BrainIAC addresses a persistent gap in medical imaging AI: most brain MRI models are narrow, task-specific, and data-hungry, whereas a foundation-model approach amortizes the cost of representation learning across many applications.
By demonstrating that self-supervised pretraining on large unlabeled MRI collections transfers effectively to clinically meaningful endpoints, BrainIAC extends the foundation-model paradigm — already established in protein and genomics modeling — into structural neuroimaging.
BrainIAC uses a Vision Transformer backbone (ViT-B/16) pretrained with the SimCLR contrastive self-supervised objective. The authors compared multiple pretraining strategies and selected SimCLR-ViT-B for its consistent performance under limited labeled data. Pretraining and validation drew on a corpus of 48,965 diverse brain MRI scans. The model was evaluated across several downstream tasks, including MR sequence classification, brain age prediction, isocitrate dehydrogenase (IDH) mutation classification in low-grade glioma, mild cognitive impairment classification, diffuse glioma overall-survival prediction, time-to-stroke prediction, and tumor segmentation. Reported results include a brain-age mean absolute error of 6.55 years, a time-to-stroke MAE of 38.87 days, and an AUC of 0.79 for IDH mutation prediction — outperforming conventional task-specific frameworks, with the largest gains where labeled data were scarce or task complexity was high.
BrainIAC is intended as a shared starting point for teams building brain MRI analysis tools across radiology, neuro-oncology, neurology, and aging research. Clinicians and researchers can adapt the pretrained encoder to estimate brain age, flag tumor genotypes such as IDH mutation status, stratify glioma survival, identify mild cognitive impairment, classify MR sequences, or segment lesions — typically with far less labeled data than a from-scratch model would require. A hosted platform lowers the barrier for groups without deep machine-learning infrastructure.
BrainIAC demonstrates that the foundation-model approach generalizes to structural brain MRI, providing a single pretrained backbone that matches or exceeds purpose-built models while dramatically reducing the labeled data needed for new tasks. Its public release of code and weights, together with a hosted platform, makes it a practical baseline and starting point for neuroimaging AI. Key limitations include a license restricting use to non-commercial academic research, a focus on structural (rather than functional or diffusion) MRI, and the need for site-specific validation before any clinical deployment.
Tak, D., et al. (2024) A foundation model for generalized brain MRI analysis. medRxiv.
DOI: 10.1101/2024.12.02.24317992Tak, D., et al. (2026) A generalizable foundation model for analysis of human brain MRI. Nature Neuroscience.
DOI: 10.1038/s41593-026-02202-6Papers that recently cited this model.
The most-cited papers that cite this model.
Not enough data