Beijing Institute of Technology / Imperial College London / Beijing Tiantan Hospital / Capital Medical University
A universal foundation model for brain lesion segmentation on multi-modal brain MRI, using a Mixture of Modality Experts to handle diverse modalities and lesion types.
Brain lesions—including tumors, strokes, white matter hyperintensities, and multiple sclerosis plaques—appear with markedly different characteristics depending on the MRI modality used to image them. As a result, automated segmentation has historically relied on task-specific models trained for a single lesion type on a single set of modalities, limiting their reuse and generalization. MoME (Mixture of Modality Experts) addresses this fragmentation by providing a single universal foundation model capable of segmenting many lesion types across heterogeneous multi-modal brain MRI.
Introduced in May 2024 by researchers at the Beijing Institute of Technology, Imperial College London, and Beijing Tiantan Hospital (Capital Medical University), MoME was early-accepted to MICCAI 2024. Its central idea borrows from the Mixture of Experts paradigm: rather than forcing one network to learn every modality at once, MoME assembles a team of expert networks, each specialized for a particular imaging modality, and combines their predictions through a learned gating mechanism. This design lets the model exploit modality-specific cues while still producing a unified segmentation.
By treating lesion segmentation as a universal rather than narrow task, MoME fits into the broader move toward generalist medical-imaging foundation models—models intended to serve as reusable backbones across many clinical datasets and acquisition protocols rather than being retrained from scratch for each new study.
MoME is built on the widely used nnU-Net segmentation framework, with the mixture-of-experts structure layered on top of convolutional U-Net backbones. Each modality expert is a segmentation network, and a hierarchical gating network fuses their outputs; a curriculum learning schedule maintains expert specialization during joint training. The model was developed and evaluated using nine public and in-house brain lesion datasets—including BraTS, ATLAS, OASIS, ISLES, WMH2017, and MSSEG—spanning five MRI modalities and eight lesion types across 17 segmentation tasks. The authors report that MoME outperforms state-of-the-art universal segmentation models across different modalities and lesion types, while also generalizing to unseen datasets. Pretrained checkpoints for the modality experts and the full MoME model are released, and the code is available under an Apache-2.0 license.
MoME targets neuroimaging research and clinical workflows where brain lesions must be delineated across diverse MRI protocols, such as tumor volumetry, stroke lesion quantification, multiple sclerosis lesion load tracking, and white matter hyperintensity assessment. Because one model handles multiple modalities and lesion types, it is well suited to multi-site studies and heterogeneous clinical archives where acquisition protocols vary, reducing the need to build and maintain a separate segmentation pipeline for each lesion type or scanner. Radiology researchers and medical image analysis groups benefit from a reusable backbone that can be applied or fine-tuned across many lesion segmentation tasks.
MoME contributes to the growing effort to build universal foundation models for medical image segmentation, demonstrating that a modality-aware mixture of experts can outperform single-network universal models on brain MRI. Its public Apache-2.0 code and released checkpoints lower the barrier for groups working on brain lesion analysis to adopt or extend the approach. The work was subsequently extended into a peer-reviewed journal version, "A Foundation Model for Lesion Segmentation on Brain MRI With Mixture of Modality Experts," published in IEEE Transactions on Medical Imaging (vol. 44, no. 6, pp. 2594–2604, 2025), which adds handling of combined multi-modality inputs via a soft-assignment dispatch network. Checkpoints for this extended model (MoME+) are distributed alongside the original on the project's GitHub and Hugging Face repositories, signaling continued development of the MoME line beyond the original MICCAI conference paper.
Zhang, X., et al. (2025) A Foundation Model for Lesion Segmentation on Brain MRI With Mixture of Modality Experts. IEEE Transactions on Medical Imaging.
DOI: 10.1109/TMI.2025.3540809Zhang, X., et al. (2024) A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts. International Conference on Medical Image Computing and Computer-Assisted Intervention.
DOI: 10.48550/arXiv.2405.10246Papers that recently cited this model.
The most-cited papers that cite this model.
Not enough data