SAM-Med3D

Fully 3D promptable segmentation foundation model for volumetric CT and MR, encoding whole volumes so anatomy can be segmented from one prompt point.

Released: October 2023

SAM-Med3D adapts the promptable segmentation paradigm of Meta AI's Segment Anything Model (SAM) to volumetric medical imaging by rebuilding the architecture to be fully 3D. The original SAM and its medical 2D derivative SAM-Med2D operate slice-by-slice, processing each axial plane independently and requiring a prompt on every slice to segment a 3D structure. This discards the rich spatial context that runs through CT and MR volumes. SAM-Med3D instead encodes whole volumes natively, capturing inter-slice context and allowing a clinician to segment an entire 3D anatomy from as few as one prompt point.

The model was released in October 2023 by researchers at Shanghai AI Laboratory (General Vision Group) and academic collaborators, with code and weights distributed through the uni-medical GitHub organization. Its central contribution is twofold: a fully volumetric re-implementation of SAM's image encoder, prompt encoder, and mask decoder using 3D operations, and the assembly of one of the largest volumetric medical segmentation corpora to date for training it.

SAM-Med3D sits within the family of medical segmentation foundation models (alongside MedSAM and SAM-Med2D) that aim to replace bespoke per-task networks with a single promptable model. By moving from 2D to genuine 3D, it targets the dominant data modalities in radiology, where volumetric reasoning is essential.

Key Features

Fully 3D architecture: Re-engineers SAM's encoder, prompt encoder, and decoder with volumetric operations so it processes entire scans natively rather than slice-by-slice.
Prompt efficiency: Segments a 3D structure from as few as one prompt point per volume, requiring roughly 10–100x fewer prompts than slice-wise SAM/SAM-Med2D for comparable results.
Large-scale volumetric training: Trained on the SA-Med3D-140K corpus of ~22,000 3D images and ~143,000 3D masks spanning 245 anatomical categories.
Broad zero-shot generalization: Validated across 16 volumetric datasets covering diverse organs, lesions, and modalities, including held-out zero-shot transfer.
Open release: Code and checkpoints are distributed under Apache-2.0, with the SAM-Med3D-turbo variant fine-tuned on 44 datasets for stronger general-purpose performance.

Technical Details

SAM-Med3D mirrors SAM's three-component design — image encoder, prompt encoder, and mask decoder — but replaces 2D operations with 3D counterparts so the network reasons over volumetric patches and produces 3D masks directly. It is trained in two stages on the SA-Med3D-140K dataset, aggregating roughly 22K 3D scans and 143K masks across 245 categories from public and licensed private sources. Because a single point prompt propagates spatially through the volume, the model needs far fewer interactions than slice-wise methods, and the authors report substantial Dice improvements over SAM and SAM-Med2D at matched or lower prompt budgets, while running at a fraction of their inference time for 3D targets. A subsequent SAM-Med3D-turbo checkpoint, fine-tuned on 44 datasets, further improves general-purpose accuracy and is the recommended weight in the repository.

Applications

SAM-Med3D is well suited to interactive and semi-automatic segmentation of CT and MR volumes, helping radiologists and biomedical researchers delineate organs, tumors, and other structures with minimal prompting. Because it operates on whole volumes, it accelerates 3D annotation pipelines, provides strong initialization for task-specific volumetric segmentation models, and can serve as a general-purpose backbone for medical image analysis workflows where training a dedicated 3D network per task is impractical.

Impact

SAM-Med3D demonstrated that a genuinely 3D promptable foundation model can outperform slice-wise adaptations of SAM on volumetric data while drastically reducing the prompting burden, making it a widely cited reference point for medical segmentation foundation models and spawning follow-up work such as SAM-Med3D-MoE. Its open code, public checkpoints, and the SA-Med3D-140K dataset lowered the barrier to research on volumetric promptable segmentation. Key limitations include continued reliance on user prompts for best performance, sensitivity to modalities and structures underrepresented in training, and licensing constraints on portions of the underlying private data.

Citations

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Wang, H., et al. (2023) SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images. ECCV Workshops.

DOI: 10.1007/978-3-031-91721-9_4

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Preprint

Wang, H., et al. (2023) SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images. ECCV Workshops.

DOI: 10.48550/arXiv.2310.15161

Recent citations

Papers that recently cited this model.

UniMedSeg: Unified In-Context Learning for Multi-Paradigm 2D/3D Medical Image Segmentation
Yunzhou Li, Jiesi Hu, Yanwu Yang, et al.
Jul 2026
0
Correction-aware interactive 3D tumor segmentation with sparse and revisable prompts
Hao Li, Haoxuan Li
The Visual Computer · Jun 2026
0Influential
LETT-NeXt: A Lightweight RECIST-Guided Model for 3D CT Lesion Segmentation
Sebastian Aas, E. Stenhede, A. Ranjbar
Jun 2026
0

Top citations

The most-cited papers that cite this model.

Medical Image Analysis
Zongwei Zhou, V. Sodha, Jiaxuan Pang, et al.
458
Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions
Yichi Zhang, Zhenrong Shen, Rushi Jiao
Comput. Biol. Medicine · Jan 2024
331
Medical Image Segmentation: A Comprehensive Review of Deep Learning-Based Methods
Yuxiao Gao, Yang Jiang, Yanhong Peng, et al.
Tomography · Apr 2025
96
MedSAM2: Segment Anything in 3D Medical Images and Videos
Jun Ma, Zongxin Yang, Sumin Kim, et al.
arXiv.org · Apr 2025
96
Swin-UMamba†: Adapting Mamba-Based Vision Foundation Models for Medical Image Segmentation
Jiarun Liu, Hao Yang, Hong-Yu Zhou, et al.
IEEE Transactions on Medical Imaging · Nov 2024
90

Citations

Total Citations184

Influential29

References41

GitHub

Stars944

Forks121

Open Issues0

Contributors3

Last Push10mo ago

LanguagePython

LicenseApache-2.0

HuggingFace

Downloads0

Likes3

Last Modified1y ago

Fields of citing research

Computer Science96%
Medicine95%
Engineering39%
Physics2%
Biology2%
Agricultural and Food Sciences1%
Materials Science1%
Environmental Science1%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

95Open

Usability — can I run it?100

Reproducibility — can I retrain it?95

Model Openness Framework

Class I

Open Science

Resources

GitHub Repository Research Paper HuggingFace Model Dataset

Key Features

Fully 3D architecture: Re-engineers SAM's encoder, prompt encoder, and decoder with volumetric operations so it processes entire scans natively rather than slice-by-slice.

Prompt efficiency: Segments a 3D structure from as few as one prompt point per volume, requiring roughly 10–100x fewer prompts than slice-wise SAM/SAM-Med2D for comparable results.

Large-scale volumetric training: Trained on the SA-Med3D-140K corpus of ~22,000 3D images and ~143,000 3D masks spanning 245 anatomical categories.

Broad zero-shot generalization: Validated across 16 volumetric datasets covering diverse organs, lesions, and modalities, including held-out zero-shot transfer.

Open release: Code and checkpoints are distributed under Apache-2.0, with the SAM-Med3D-turbo variant fine-tuned on 44 datasets for stronger general-purpose performance.

Technical Details

Applications

Impact

Citations

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Wang, H., et al. (2023) SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images. ECCV Workshops.

DOI: 10.1007/978-3-031-91721-9_4

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Preprint

Wang, H., et al. (2023) SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images. ECCV Workshops.

DOI: 10.48550/arXiv.2310.15161

Recent citations

Papers that recently cited this model.

UniMedSeg: Unified In-Context Learning for Multi-Paradigm 2D/3D Medical Image Segmentation

Yunzhou Li, Jiesi Hu, Yanwu Yang, et al.

Jul 2026

Correction-aware interactive 3D tumor segmentation with sparse and revisable prompts

Hao Li, Haoxuan Li

The Visual Computer · Jun 2026

0Influential

LETT-NeXt: A Lightweight RECIST-Guided Model for 3D CT Lesion Segmentation

Sebastian Aas, E. Stenhede, A. Ranjbar

Jun 2026

SAM-Med3D

#Key Features

#Technical Details

#Applications

#Impact

Citations

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Recent citations

UniMedSeg: Unified In-Context Learning for Multi-Paradigm 2D/3D Medical Image Segmentation

LETT-NeXt: A Lightweight RECIST-Guided Model for 3D CT Lesion Segmentation

Top citations

Medical Image Analysis

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

SAM-Med3D

#Key Features

#Technical Details

#Applications

#Impact

Citations

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Recent citations

UniMedSeg: Unified In-Context Learning for Multi-Paradigm 2D/3D Medical Image Segmentation

LETT-NeXt: A Lightweight RECIST-Guided Model for 3D CT Lesion Segmentation

Top citations

Medical Image Analysis

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact