bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Imaging foundation models
Imaging

SAM-Med3D

Shanghai AI Laboratory

A fully 3D, promptable segmentation foundation model for volumetric medical images, trained on ~22K 3D scans and ~143K masks for general-purpose anatomy segmentation.

Released: October 2023

SAM-Med3D adapts the promptable segmentation paradigm of Meta AI's Segment Anything Model (SAM) to volumetric medical imaging by rebuilding the architecture to be fully 3D. The original SAM and its medical 2D derivative SAM-Med2D operate slice-by-slice, processing each axial plane independently and requiring a prompt on every slice to segment a 3D structure. This discards the rich spatial context that runs through CT and MR volumes. SAM-Med3D instead encodes whole volumes natively, capturing inter-slice context and allowing a clinician to segment an entire 3D anatomy from as few as one prompt point.

The model was released in October 2023 by researchers at Shanghai AI Laboratory (General Vision Group) and academic collaborators, with code and weights distributed through the uni-medical GitHub organization. Its central contribution is twofold: a fully volumetric re-implementation of SAM's image encoder, prompt encoder, and mask decoder using 3D operations, and the assembly of one of the largest volumetric medical segmentation corpora to date for training it.

SAM-Med3D sits within the family of medical segmentation foundation models (alongside MedSAM and SAM-Med2D) that aim to replace bespoke per-task networks with a single promptable model. By moving from 2D to genuine 3D, it targets the dominant data modalities in radiology, where volumetric reasoning is essential.

#Key Features

  • Fully 3D architecture: Re-engineers SAM's encoder, prompt encoder, and decoder with volumetric operations so it processes entire scans natively rather than slice-by-slice.
  • Prompt efficiency: Segments a 3D structure from as few as one prompt point per volume, requiring roughly 10–100x fewer prompts than slice-wise SAM/SAM-Med2D for comparable results.
  • Large-scale volumetric training: Trained on the SA-Med3D-140K corpus of ~22,000 3D images and ~143,000 3D masks spanning 245 anatomical categories.
  • Broad zero-shot generalization: Validated across 16 volumetric datasets covering diverse organs, lesions, and modalities, including held-out zero-shot transfer.
  • Open release: Code and checkpoints are distributed under Apache-2.0, with the SAM-Med3D-turbo variant fine-tuned on 44 datasets for stronger general-purpose performance.

#Technical Details

SAM-Med3D mirrors SAM's three-component design — image encoder, prompt encoder, and mask decoder — but replaces 2D operations with 3D counterparts so the network reasons over volumetric patches and produces 3D masks directly. It is trained in two stages on the SA-Med3D-140K dataset, aggregating roughly 22K 3D scans and 143K masks across 245 categories from public and licensed private sources. Because a single point prompt propagates spatially through the volume, the model needs far fewer interactions than slice-wise methods, and the authors report substantial Dice improvements over SAM and SAM-Med2D at matched or lower prompt budgets, while running at a fraction of their inference time for 3D targets. A subsequent SAM-Med3D-turbo checkpoint, fine-tuned on 44 datasets, further improves general-purpose accuracy and is the recommended weight in the repository.

#Applications

SAM-Med3D is well suited to interactive and semi-automatic segmentation of CT and MR volumes, helping radiologists and biomedical researchers delineate organs, tumors, and other structures with minimal prompting. Because it operates on whole volumes, it accelerates 3D annotation pipelines, provides strong initialization for task-specific volumetric segmentation models, and can serve as a general-purpose backbone for medical image analysis workflows where training a dedicated 3D network per task is impractical.

#Impact

SAM-Med3D demonstrated that a genuinely 3D promptable foundation model can outperform slice-wise adaptations of SAM on volumetric data while drastically reducing the prompting burden, making it a widely cited reference point for medical segmentation foundation models and spawning follow-up work such as SAM-Med3D-MoE. Its open code, public checkpoints, and the SA-Med3D-140K dataset lowered the barrier to research on volumetric promptable segmentation. Key limitations include continued reliance on user prompts for best performance, sensitivity to modalities and structures underrepresented in training, and licensing constraints on portions of the underlying private data.

Citations

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Wang, H., et al. (2023) SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images. ECCV Workshops.

DOI: 10.1007/978-3-031-91721-9_4

SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images

Preprint

Wang, H., et al. (2023) SAM-Med3D: Towards General-Purpose Segmentation Models for Volumetric Medical Images. ECCV Workshops.

DOI: 10.48550/arXiv.2310.15161

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations172
Influential28
References41

GitHub

Stars930
Forks120
Open Issues0
Contributors3
Last Push8mo ago
LanguagePython
LicenseApache-2.0

HuggingFace

Downloads0
Likes2
Last Modified1y ago

Fields of citing research

Not enough data

Openness

bio.rodeo opennessFully open · usable and reproducible
95Open
Usability — can I run it?100
Reproducibility — can I retrain it?95
Model Openness Framework
Class I
Open Science

Tags

foundation_modelradiologysegmentationtransfer_learningvision_transformerzero_shot

Resources

GitHub RepositoryResearch PaperHuggingFace ModelDataset