SAM-Med2D

Medical imaging adaptation of the Segment Anything Model, fine-tuned on 4.6M images and 19.7M masks for promptable segmentation across 10 modalities.

Released: August 2023

SAM-Med2D adapts Meta AI's Segment Anything Model (SAM) — a promptable, general-purpose image segmentation foundation model — to the medical imaging domain. While the original SAM was trained on natural images and shows degraded performance on medical scans, where boundaries are subtle, contrast is low, and objects differ markedly from everyday photographs, SAM-Med2D closes this domain gap through large-scale fine-tuning on curated medical data. It was released in August 2023 by researchers at Shanghai AI Laboratory (OpenGVLab) together with academic collaborators.

The core contribution is the assembly of one of the largest medical segmentation corpora to date and a fine-tuning recipe that injects medical domain knowledge into SAM while keeping inference promptable. The authors collected and curated approximately 4.6 million images and 19.7 million masks (the SA-Med2D-20M dataset) spanning 10 imaging modalities, 4 anatomical structure groups plus lesions, and 31 major human organs. Rather than retraining from scratch, they freeze the heavy image encoder and insert lightweight learnable adapter layers, then fine-tune the prompt encoder and mask decoder interactively.

SAM-Med2D fits into the rapidly growing family of medical segmentation foundation models (alongside efforts such as MedSAM) that aim to provide a single, promptable model usable across organs and modalities, reducing the need to train a bespoke network for every new segmentation task.

Key Features

Promptable medical segmentation: Accepts point, bounding-box, and mask prompts, supporting both fully interactive and semi-automatic clinical annotation workflows.
Adapter-based fine-tuning: Freezes SAM's image encoder and learns adapter layers within each Transformer block, efficiently capturing domain-specific medical knowledge.
Broad modality coverage: Trained across 10 modalities (including CT, MR, and X-ray), 31 major organs, and lesion classes for wide anatomical generality.
Largest-scale medical mask corpus: Built on the SA-Med2D-20M dataset of ~4.6M images and ~19.7M masks aggregated from public and private sources.
Open and reproducible: Code released under Apache-2.0 with public checkpoints and the training dataset card hosted on HuggingFace.

Technical Details

SAM-Med2D retains SAM's ViT-based image encoder, prompt encoder, and lightweight mask decoder. During adaptation the image encoder is frozen and augmented with learnable adapter modules in each Transformer block, while the prompt encoder and mask decoder are fine-tuned through interactive, multi-prompt training at a default 256×256 input resolution. On evaluation, the model reports roughly 79.3% Dice with bounding-box prompts and about 70.0% Dice with a single point prompt, and the authors validate generalization across 9 MICCAI 2023 challenge datasets. The released checkpoints run at interactive speeds (around 35 FPS). The accompanying SA-Med2D-20M dataset is distributed under CC-BY-NC-SA-4.0, while the model code and weights are Apache-2.0.

Applications

SAM-Med2D is suited to interactive and semi-automatic annotation of 2D medical images, helping radiologists, pathologists, and biomedical researchers segment organs, lesions, and structures across CT, MR, X-ray, and other modalities with minimal prompting. It can accelerate the creation of labeled datasets for downstream supervised models, serve as a strong initialization for task-specific segmentation, and act as a general-purpose backbone for medical image analysis pipelines where training a dedicated model per task is impractical.

Impact

By pairing SAM with one of the largest curated medical mask collections, SAM-Med2D became a widely referenced benchmark for evaluating how segmentation foundation models transfer to medical imaging, and a practical starting point for groups building promptable annotation tools. Its open code, public checkpoints, and the SA-Med2D-20M dataset lowered the barrier to research on medical segmentation foundation models. Key limitations include its 2D-only scope (it does not natively model 3D volumetric context), reliance on user prompts for best performance, and the non-commercial license on the underlying dataset, which constrains some downstream uses.

Citation

SAM-Med2D

Preprint

Cheng, J., et al. (2023) SAM-Med2D.

DOI: 10.48550/arXiv.2308.16184

Recent citations

Papers that recently cited this model.

XCT-SAM: Sequential Parameter-Efficient Domain Adaptation of SAM for Industrial XCT Defect Segmentation
Mahamudul Hasan, Md. Mushfiqur Rahaman, Alan Pachkovskiy, et al.
Jul 2026
0
MedDistill: Multilevel Distilled Co-Learning for Unsupervised Medical Image Segmentation
Te Guo, Tianyu Shen, Hongwei Yu, et al.
IEEE Sensors Journal · Jul 2026
0
Decoupling Language Guidance from Backbones for Text-Guided Medical Segmentation
Yung-Hsing Liu, Xuan Fang, Haijin Zeng, et al.
Jul 2026
0

Top citations

The most-cited papers that cite this model.

Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions
Yichi Zhang, Zhenrong Shen, Rushi Jiao
Comput. Biol. Medicine · Jan 2024
331
Medical SAM 2: Segment medical images as video via Segment Anything Model 2
Jiayuan Zhu, Yunli Qi, Junde Wu
arXiv.org · Aug 2024
230
SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images
Haoyu Wang, Sizheng Guo, Jin Ye, et al.
ECCV Workshops · Oct 2023
173
Foundation Model for Advancing Healthcare: Challenges, Opportunities and Future Directions
Yuting He, Fuxiang Huang, Xinrui Jiang, et al.
IEEE Reviews in Biomedical Engineering · Apr 2024
134
I-MedSAM: Implicit Medical Image Segmentation with Segment Anything
Xiaobao Wei, Jiajun Cao, Yizhu Jin, et al.
European Conference on Computer Vision · Nov 2023
47

Citations

Total Citations258

Influential28

References42

GitHub

Stars1.1K

Forks109

Open Issues51

Contributors5

Last Push2y ago

LanguageJupyter Notebook

LicenseApache-2.0

Fields of citing research

Computer Science98%
Medicine89%
Engineering34%
Biology3%
Environmental Science2%
Physics0%
Materials Science0%
Linguistics0%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

82Open

Usability — can I run it?100

Reproducibility — can I retrain it?57

Model Openness Framework

Class III

Open Model

Resources

GitHub Repository Research Paper Dataset

Key Features

Promptable medical segmentation: Accepts point, bounding-box, and mask prompts, supporting both fully interactive and semi-automatic clinical annotation workflows.

Adapter-based fine-tuning: Freezes SAM's image encoder and learns adapter layers within each Transformer block, efficiently capturing domain-specific medical knowledge.

Broad modality coverage: Trained across 10 modalities (including CT, MR, and X-ray), 31 major organs, and lesion classes for wide anatomical generality.

Largest-scale medical mask corpus: Built on the SA-Med2D-20M dataset of ~4.6M images and ~19.7M masks aggregated from public and private sources.

Open and reproducible: Code released under Apache-2.0 with public checkpoints and the training dataset card hosted on HuggingFace.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

XCT-SAM: Sequential Parameter-Efficient Domain Adaptation of SAM for Industrial XCT Defect Segmentation

Mahamudul Hasan, Md. Mushfiqur Rahaman, Alan Pachkovskiy, et al.

Jul 2026

MedDistill: Multilevel Distilled Co-Learning for Unsupervised Medical Image Segmentation

Te Guo, Tianyu Shen, Hongwei Yu, et al.

IEEE Sensors Journal · Jul 2026

Decoupling Language Guidance from Backbones for Text-Guided Medical Segmentation

Yung-Hsing Liu, Xuan Fang, Haijin Zeng, et al.

Jul 2026

SAM-Med2D

#Key Features

#Technical Details

#Applications

#Impact

Citation

SAM-Med2D

Recent citations

XCT-SAM: Sequential Parameter-Efficient Domain Adaptation of SAM for Industrial XCT Defect Segmentation

Decoupling Language Guidance from Backbones for Text-Guided Medical Segmentation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

SAM-Med2D

#Key Features

#Technical Details

#Applications

#Impact

Citation

SAM-Med2D

Recent citations

XCT-SAM: Sequential Parameter-Efficient Domain Adaptation of SAM for Industrial XCT Defect Segmentation

Decoupling Language Guidance from Backbones for Text-Guided Medical Segmentation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact