LaMIM

West China Hospital of Sichuan University / NVIDIA

Brain MRI foundation model pretrained with masked image modeling on roughly 57,000 multi-contrast head scans for brain tumor diagnosis.

Released: April 2024

LaMIM (Large Medical Image foundation Model) is a self-supervised foundation model for multi-contrast brain MRI, developed by researchers at West China Hospital of Sichuan University (with a collaborator from NVIDIA) and published in European Radiology in 2024. It addresses a persistent bottleneck in medical imaging AI: high-quality labels are scarce and expensive, while unlabeled scans are abundant. By pretraining on a large pool of unlabeled head MRIs, LaMIM learns general-purpose volumetric representations that can be transferred to specific diagnostic tasks with comparatively little labeled data.

The work is framed as a pilot study demonstrating that the masked-image-modeling paradigm—which reshaped natural-image and language pretraining—translates to 3D whole-brain MRI. Rather than training a task-specific network from scratch, the authors pretrain a vision transformer autoencoder to reconstruct deliberately corrupted MRI volumes, then attach lightweight classifiers for downstream brain tumor applications. This positions LaMIM within the broader wave of medical imaging foundation models that seek to amortize the cost of annotation across many clinical tasks.

The model is notable for operating directly on multi-contrast 3D volumes (T1w, T1c, T2w, and FLAIR) and for releasing pretrained weights, making it a practical starting point for neuroimaging researchers building tumor-related classifiers.

Key Features

Self-supervised pretraining at scale: Trained on 57,621 enhanced head MRI scans without manual labels, learning brain anatomy and contrast structure from raw data.
Masked image modeling for 3D MRI: Uses cross-contrast context restoration with content-dropout (masking) schemes, requiring the model to reconstruct hidden volume blocks from surrounding context.
Two complementary masking granularities: Released checkpoints use either 16×16×16 blocks (86 blocks masked) or fine-grained 4×4×4 blocks (6,000 blocks), capturing both coarse and fine spatial structure.
Strong label efficiency: Fine-tuned classifiers initialized from pretrained weights outperformed convolutional models trained from scratch by roughly 10% in accuracy and AUC across all evaluated tasks.
Open pretrained weights: Both ViT checkpoints (block-16 and block-4) are distributed through the project repository for downstream reuse.

Technical Details

LaMIM is built on a vision transformer autoencoder (ViTAutoEnc) that ingests multi-contrast 3D brain MRI volumes. Pretraining follows a masked-image-modeling objective: input volumes are corrupted with content-dropout, and the model is trained to restore the missing regions using cross-contrast context, forcing it to learn anatomical and tissue-contrast priors. The authors release two variants distinguished by masking granularity—coarse 16×16×16 blocks versus fine 4×4×4 blocks. For downstream evaluation, classifiers were attached to the pretrained encoder and fine-tuned on labeled tumor data. On independent test sets, the pretrained models reached 94.9% accuracy (AUC 0.981) for brain tumor detection, 92.3% accuracy (AUC 0.972) for tumor discrimination, and 80.4% accuracy (AUC 0.852) for molecular status prediction, consistently surpassing from-scratch convolutional baselines.

Applications

LaMIM targets neuro-oncology imaging workflows, where it can serve as a pretrained backbone for brain tumor detection, tumor-type discrimination, and molecular status prediction from multi-contrast MRI. By transferring features learned from tens of thousands of unlabeled scans, it lets radiology and neuroimaging researchers build accurate classifiers from modest labeled cohorts, lowering the annotation burden for new diagnostic tasks. The released checkpoints provide a practical initialization for groups developing volumetric MRI classifiers without access to large in-house labeled datasets.

Impact

As an early demonstration that self-supervised masked image modeling can be applied to large-scale 3D multi-contrast brain MRI, LaMIM contributes to the growing evidence that foundation-model pretraining improves label efficiency and performance in neuroimaging. Its consistent ~10% gains over from-scratch baselines underscore the value of unlabeled clinical archives for building diagnostic models. As a pilot study with openly released weights, LaMIM is best viewed as a proof of concept and reusable starting point rather than a clinically validated tool; downstream results were obtained on the authors' tumor cohorts, and broader external validation across sites and scanners remains future work.

Citation

Medical image foundation models in assisting diagnosis of brain tumors: a pilot study

Chen, M., et al. (2024) Medical image foundation models in assisting diagnosis of brain tumors: a pilot study. European Radiology.

DOI: 10.1007/s00330-024-10728-1

Recent citations

Papers that recently cited this model.

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation
A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.
Jul 2026
0Influential
Advanced Deep Learning Architectures in MRI-Based Brain Tumor Classification: A Systematic Review Focused on Meningiomas.
Naima Noor, C. Turner, Samantha J. Holdsworth, et al.
Journal of imaging informatics in medicine · Jul 2026
0
Machine learning and deep learning for neurological disease analysis: A systematic review across five major disorders.
Kazi Nur Uddin, Partho Ghose, Ebrima Njie, et al.
Neuroscience · Jun 2026
0

Top citations

The most-cited papers that cite this model.

Foundation models for radiology: fundamentals, applications, opportunities, challenges, risks, and prospects
Tugba Akinci D’Antonoli, Christian Bluethgen, Renato Cuocolo, et al.
Diagnostic and Interventional Radiology · Jul 2025
17
State-of-the-Art in Responsible, Explainable, and Fair AI for Medical Image Analysis
Soheyla Amirian, Fengyi Gao, Nickolas Littlefield, et al.
IEEE Access · 2025
16
Artificial intelligence in neurosurgery: a systematic review of applications, model comparisons, and ethical implications
J. Reyes, Vanshika N. Lohia, Timoteo Almeida, et al.
Neurosurgical review · May 2025
15
Building a General SimCLR Self-Supervised Foundation Model Across Neurological Diseases to Advance 3D Brain MRI Diagnoses
E. Kaczmarek, Justin Szeto, B. Nichyporuk, et al.
2025 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) · Sep 2025
9Influential
Automatic identification of Parkinsonism using clinical multi-contrast brain MRI: a large self-supervised vision foundation model strategy
Xueling Suo, Mengyao Chen, Li Chen, et al.
EBioMedicine · May 2025
8

Citations

Total Citations22

Influential2

References32

GitHub

Stars19

Forks2

Open Issues1

Contributors1

Last Push2y ago

LanguagePython

Fields of citing research

Medicine100%
Computer Science81%
Engineering19%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

15Closed

Usability — can I run it?20

Reproducibility — can I retrain it?11

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper

Key Features

Self-supervised pretraining at scale: Trained on 57,621 enhanced head MRI scans without manual labels, learning brain anatomy and contrast structure from raw data.

Masked image modeling for 3D MRI: Uses cross-contrast context restoration with content-dropout (masking) schemes, requiring the model to reconstruct hidden volume blocks from surrounding context.

Two complementary masking granularities: Released checkpoints use either 16×16×16 blocks (86 blocks masked) or fine-grained 4×4×4 blocks (6,000 blocks), capturing both coarse and fine spatial structure.

Strong label efficiency: Fine-tuned classifiers initialized from pretrained weights outperformed convolutional models trained from scratch by roughly 10% in accuracy and AUC across all evaluated tasks.

Open pretrained weights: Both ViT checkpoints (block-16 and block-4) are distributed through the project repository for downstream reuse.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.

Jul 2026

0Influential

Advanced Deep Learning Architectures in MRI-Based Brain Tumor Classification: A Systematic Review Focused on Meningiomas.

Naima Noor, C. Turner, Samantha J. Holdsworth, et al.

Journal of imaging informatics in medicine · Jul 2026

Machine learning and deep learning for neurological disease analysis: A systematic review across five major disorders.

Kazi Nur Uddin, Partho Ghose, Ebrima Njie, et al.

Neuroscience · Jun 2026

LaMIM

#Key Features

#Technical Details

#Applications

#Impact

Citation

Medical image foundation models in assisting diagnosis of brain tumors: a pilot study

Recent citations

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

LaMIM

#Key Features

#Technical Details

#Applications

#Impact

Citation

Medical image foundation models in assisting diagnosis of brain tumors: a pilot study

Recent citations

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact