MedLSAM

Shanghai AI Laboratory / Shanghai Jiao Tong University / University of Science and Technology of China / Sichuan University

3D CT localization foundation model that pairs MedLAM with SAM to segment any anatomical structure at a fixed, dataset-independent annotation cost.

Released: June 2023

MedLSAM ("Localize and Segment Anything Model") addresses a persistent bottleneck in 3D medical image segmentation: the Segment Anything Model (SAM) and its medical variants are promptable, but they still require a human to supply bounding boxes or points on every slice of every scan. For volumetric CT, where a single study contains hundreds of slices and a dataset may contain thousands of studies, this prompting cost scales with the data and quickly becomes prohibitive. MedLSAM is presented by its authors as the first complete medical adaptation of SAM that removes this per-image burden.

The system's central contribution is MedLAM, a foundational localization model for 3D medical images, developed by researchers at Shanghai AI Laboratory, Shanghai Jiao Tong University, the University of Science and Technology of China, and West China Hospital of Sichuan University, and released as a preprint in June 2023 (published in Medical Image Analysis, 2025). MedLAM learns to locate any anatomical structure in a CT volume from only a handful of annotated template scans. By pairing MedLAM-generated bounding boxes with SAM (or MedSAM) for the actual mask prediction, MedLSAM segments arbitrary organs across an entire dataset while keeping the manual annotation effort fixed regardless of dataset size.

Key Features

Localize-anything foundation model: MedLAM identifies any anatomical region in a 3D CT volume given only a few template scans, acting as a reusable anatomical localizer rather than a single-organ detector.
Constant annotation cost: The pipeline requires extreme-point annotations in three directions on just a few template images; this cost does not grow as the target dataset grows, unlike slice-by-slice prompting.
Self-supervised pretraining: MedLAM is trained without segmentation labels using two self-supervision tasks — unified anatomical mapping (relative distance regression) and multi-scale similarity.
Plug-in segmentation backend: MedLAM's predicted boxes drive SAM or MedSAM, so the framework benefits directly from improvements to the underlying promptable segmenter.
Fully automated dataset segmentation: Once templates are annotated, the system segments target anatomy across all remaining scans with no further human input.

Technical Details

MedLAM is trained on 14,012 CT scans using two self-supervised objectives: unified anatomical mapping (UAM), which regresses the relative 3D displacement between query and support patches to build a consistent anatomical coordinate frame, and multi-scale similarity (MSS), which refines localization through feature matching across scales. At inference, a few template scans with extreme-point annotations define each target structure, MedLAM predicts a 3D bounding box for that structure in unseen volumes, and the box is converted into prompts for SAM or MedSAM. The authors evaluate on two 3D datasets — StructSeg Task1 and WORD — covering 38 distinct organs. MedLAM's localization reaches accuracy comparable to fully supervised detection models, and MedLSAM matches the segmentation performance of SAM/MedSAM while eliminating the dataset-scaling annotation burden. Code is released under Apache-2.0, with public MedLAM checkpoints distributed alongside SAM ViT-B and MedSAM ViT-B weights.

Applications

MedLSAM is aimed at researchers and clinicians who need to segment organs or structures across large CT cohorts — for example, radiotherapy planning, organ-at-risk delineation, automated dataset labeling, and large-scale anatomical quantification. Because MedLAM is a general localizer, it is reusable across segmentation targets and downstream segmenters, making it useful both as a standalone anatomical localization tool and as a front end that turns interactive promptable segmenters into automated batch-processing pipelines.

Impact

By decoupling annotation effort from dataset size, MedLSAM offered an early and influential template for adapting the foundation-model "segment anything" paradigm to volumetric medical imaging, where naive per-slice prompting is impractical. The accompanying MedLAM localizer is of independent interest as a CT localization foundation model, and the project's public code and weights have made it a frequently cited reference point in the literature on SAM-for-medical-imaging. Its primary limitations are that MedLAM was trained and validated on CT (other modalities are not covered) and that final segmentation quality remains bounded by the underlying SAM/MedSAM backbone.

Citation

MedLSAM: Localize and segment anything model for 3D CT images

Lei, W., et al. (2024) MedLSAM: Localize and segment anything model for 3D CT images. Medical Image Anal..

DOI: 10.1016/j.media.2024.103370

Recent citations

Papers that recently cited this model.

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation
A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.
Jul 2026
0
SparseMed3D: Foundation Models for Sparse Instance Medical Segmentation.
Erfan Darzidehkalani, Cheng-Huang Hsiao, Rina Bao, et al.
Journal of imaging informatics in medicine · Jul 2026
0
Structure-aware chain-of-thought for interpretable few-shot coronary CTA segmentation
Xinyu Bao, Chun Liu, Zhihui Zhang, et al.
Expert systems with applications · Jun 2026
2

Top citations

The most-cited papers that cite this model.

BrainSegFounder: Towards 3D foundation models for neuroimage segmentation
Joseph Cox, Peng Liu, S. Stolte, et al.
Medical Image Anal. · Jun 2024
67
SAM-Med3D: A Vision Foundation Model for General-Purpose Segmentation on Volumetric Medical Images
Haoyu Wang, Sizheng Guo, Jin Ye, et al.
IEEE Transactions on Neural Networks and Learning Systems · Jul 2025
42
Retrieval-Augmented Few-Shot Medical Image Segmentation With Foundation Models
Lin Zhao, Xiao Chen, Eric Z. Chen, et al.
IEEE Transactions on Neural Networks and Learning Systems · Aug 2024
29
CellSAM: a foundation model for cell segmentation
Markus Marks, Uriah Israel, Rohit Dilip, et al.
Nature Methods · Dec 2025
27
Innovating robot-assisted surgery through large vision models
Zhe Min, Jiewen Lai, Hongliang Ren
Nature Reviews Electrical Engineering · May 2025
25Influential

Citations

Total Citations82

Influential6

References43

GitHub

Stars522

Forks24

Open Issues17

Contributors1

Last Push2y ago

LanguagePython

LicenseApache-2.0

Fields of citing research

Computer Science95%
Medicine94%
Engineering38%
Biology8%
Environmental Science1%
Materials Science1%
Sociology1%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

76Open

Usability — can I run it?94

Reproducibility — can I retrain it?53

Model Openness Framework

Unclassified

Missing required components

Resources

GitHub Repository Research Paper Research Paper

Key Features

Localize-anything foundation model: MedLAM identifies any anatomical region in a 3D CT volume given only a few template scans, acting as a reusable anatomical localizer rather than a single-organ detector.

Constant annotation cost: The pipeline requires extreme-point annotations in three directions on just a few template images; this cost does not grow as the target dataset grows, unlike slice-by-slice prompting.

Self-supervised pretraining: MedLAM is trained without segmentation labels using two self-supervision tasks — unified anatomical mapping (relative distance regression) and multi-scale similarity.

Plug-in segmentation backend: MedLAM's predicted boxes drive SAM or MedSAM, so the framework benefits directly from improvements to the underlying promptable segmenter.

Fully automated dataset segmentation: Once templates are annotated, the system segments target anatomy across all remaining scans with no further human input.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.

Jul 2026

SparseMed3D: Foundation Models for Sparse Instance Medical Segmentation.

Erfan Darzidehkalani, Cheng-Huang Hsiao, Rina Bao, et al.

Journal of imaging informatics in medicine · Jul 2026

Structure-aware chain-of-thought for interpretable few-shot coronary CTA segmentation

Xinyu Bao, Chun Liu, Zhihui Zhang, et al.

Expert systems with applications · Jun 2026

MedLSAM

#Key Features

#Technical Details

#Applications

#Impact

Citation

MedLSAM: Localize and segment anything model for 3D CT images

Recent citations

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

MedLSAM

#Key Features

#Technical Details

#Applications

#Impact

Citation

MedLSAM: Localize and segment anything model for 3D CT images

Recent citations

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact