UltraSam

Promptable ultrasound image segmentation foundation model, a SAM adaptation trained on US-43d, the largest public ultrasound segmentation corpus.

Released: November 2024

Ultrasound is one of the most widely used clinical imaging modalities, yet it remains comparatively underserved by deep learning because of its low signal-to-noise ratio, speckle artifacts, low contrast, and operator-dependent acquisition. General-purpose segmentation foundation models such as the Segment Anything Model (SAM) and its medical adaptation MedSAM transfer poorly to this domain, where boundaries are diffuse and anatomical appearance varies dramatically across probes and body regions. UltraSam addresses this gap with a deliberately data-centric strategy: rather than designing a bespoke architecture, the authors assemble the largest public ultrasound segmentation corpus to date and use it to specialize a promptable SAM-style model for ultrasound.

UltraSam was developed by the CAMMA group at the University of Strasbourg (ICube, CNRS/INSERM) and IHU Strasbourg, and introduced by Adrien Meyer, Aditya Murali, Didier Mutter, and Nicolas Padoy in a preprint released in November 2024 and subsequently published in the International Journal of Computer Assisted Radiology and Surgery (IJCARS) in 2025. The model accepts point and bounding-box prompts to produce segmentation masks, and its pretrained weights serve as a strong initialization for a wide range of downstream ultrasound tasks.

The central contribution is twofold: US-43d, a unified collection of 43 open-access ultrasound segmentation datasets, and the UltraSam checkpoint trained on it. Together they provide the ultrasound community with both a reusable benchmark dataset and a ready-to-fine-tune foundation model, lowering the barrier for building segmentation and classification systems across diverse anatomical applications.

Key Features

Promptable ultrasound segmentation: Adapts the SAM architecture to accept point and bounding-box prompts, producing interactive masks tuned to the appearance of ultrasound rather than natural images.
US-43d training corpus: Aggregates 43 open-access datasets into a single harmonized resource of over 280,000 images and segmentation masks spanning more than 50 anatomical structures and roughly 20 clinical applications.
Strong transfer initialization: Pretrained weights consistently outperform ImageNet-, SAM-, and MedSAM-initialized models when fine-tuned on downstream segmentation and classification benchmarks.
Broad anatomical coverage: Training spans both 2D and 3D scans across organs and regions including cardiac, fetal, breast, thyroid, nerve, and musculoskeletal imaging.
Publicly available release: Code, the assembled dataset pipeline, and a pretrained checkpoint are publicly available under a non-commercial CC-BY-NC-SA-4.0 license, supporting reproducibility and reuse for non-commercial purposes.

Technical Details

UltraSam is built on the Segment Anything Model, which couples a Vision Transformer (ViT) image encoder with a prompt encoder and a lightweight mask decoder. The authors fine-tune this architecture end-to-end on US-43d, retraining the encoder so that learned representations reflect ultrasound-specific statistics rather than the natural-image distribution SAM was originally trained on. US-43d itself comprises 43 publicly available ultrasound segmentation datasets, totaling more than 280,000 images and corresponding masks for over 50 anatomical structures, making it the largest public ultrasound segmentation collection assembled at the time of release. In evaluations, UltraSam delivers substantially higher segmentation accuracy under point- and box-prompted settings than SAM and MedSAM, and when used as a pretrained backbone it improves downstream segmentation and classification performance over ImageNet, SAM, and MedSAM initializations across held-out ultrasound tasks.

Applications

UltraSam is intended as a general-purpose starting point for ultrasound image analysis. Clinicians and researchers can apply it directly for interactive, prompt-driven delineation of anatomy and lesions, or fine-tune it as a backbone for task-specific segmentation and classification pipelines in cardiology, obstetrics, breast and thyroid screening, regional anesthesia (nerve localization), and musculoskeletal imaging. Because it ships with a harmonized multi-dataset corpus assembled from open-access sources, it is also valuable to the methods community as a benchmark for developing and comparing new ultrasound segmentation approaches.

Impact

By demonstrating that a data-centric strategy, aggregating heterogeneous open datasets and specializing a strong general segmentation model, can yield a versatile ultrasound foundation model, UltraSam offers a practical template for under-resourced imaging modalities. The release of US-43d is itself a meaningful community contribution, consolidating fragmented public data into a reusable resource and lowering the cost of entry for ultrasound AI research. As a foundation-model initialization that outperforms widely used SAM and MedSAM baselines, UltraSam is positioned to accelerate development of downstream clinical ultrasound tools. Its principal limitations stem from reliance on assembled public datasets, which may carry uneven annotation quality and demographic or scanner biases, and on prompt-based interaction rather than fully automatic segmentation.

Citation

Ultrasam: a foundation model for ultrasound using large open-access segmentation datasets

Meyer, A., et al. (2024) Ultrasam: a foundation model for ultrasound using large open-access segmentation datasets. International Journal of Computer Assisted Radiology and Surgery.

DOI: 10.1007/s11548-025-03517-8

Recent citations

Papers that recently cited this model.

Learning from Complementary Ultrasound Representations for Liver Disease Classification
Sabahattin Mert Daloglu, Gokce Bekar, Ceren Çoşkun, et al.
Jul 2026
0
Analyzing Image Encoder Choices and Graph Homophily in GCN Frameworks for Breast Ultrasound Classification
Sabahattin Mert Daloglu, Ceren Çoşkun, Harvey Castro, et al.
Jul 2026
0
Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation
A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.
Jul 2026
0Influential

Top citations

The most-cited papers that cite this model.

USF-MAE: Ultrasound Self-Supervised Foundation Model with Masked Autoencoding
Youssef Megahed, R. Ducharme, M. Walker, et al.
arXiv.org · Oct 2025
11Influential
A Fully Open and Generalizable Foundation Model for Ultrasound Clinical Applications
Hongyuan Zhang, Yuheng Wu, Mingyang Zhao, et al.
arXiv.org · Sep 2025
10Influential
Grounding DINO-US-SAM: Text-Prompted Multiorgan Segmentation in Ultrasound With LoRA-Tuned Vision–Language Models
Hamza Rasaee, Taha Koleilat, Hassan Rivaz
IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control · Jun 2025
4
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis
Jingguo Qu, Xinyang Han, Tonghuan Xiao, et al.
Expert systems with applications · Jun 2025
4
US-JEPA: A Joint Embedding Predictive Architecture for Medical Ultrasound
A. Radhachandran, Vedrana Ivezi'c, Shreeram Athreya, et al.
arXiv.org · Feb 2026
2

Citations

Total Citations33

Influential4

References50

GitHub

Stars136

Forks22

Open Issues12

Contributors1

Last Push3mo ago

LanguagePython

Fields of citing research

Medicine100%
Computer Science84%
Engineering84%
Mathematics3%
Physics3%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

26Closed

Usability — can I run it?22

Reproducibility — can I retrain it?13

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper

Key Features

Promptable ultrasound segmentation: Adapts the SAM architecture to accept point and bounding-box prompts, producing interactive masks tuned to the appearance of ultrasound rather than natural images.

US-43d training corpus: Aggregates 43 open-access datasets into a single harmonized resource of over 280,000 images and segmentation masks spanning more than 50 anatomical structures and roughly 20 clinical applications.

Strong transfer initialization: Pretrained weights consistently outperform ImageNet-, SAM-, and MedSAM-initialized models when fine-tuned on downstream segmentation and classification benchmarks.

Broad anatomical coverage: Training spans both 2D and 3D scans across organs and regions including cardiac, fetal, breast, thyroid, nerve, and musculoskeletal imaging.

Publicly available release: Code, the assembled dataset pipeline, and a pretrained checkpoint are publicly available under a non-commercial CC-BY-NC-SA-4.0 license, supporting reproducibility and reuse for non-commercial purposes.

Technical Details

Applications

Impact

Citation

Ultrasam: a foundation model for ultrasound using large open-access segmentation datasets

Meyer, A., et al. (2024) Ultrasam: a foundation model for ultrasound using large open-access segmentation datasets. International Journal of Computer Assisted Radiology and Surgery.

DOI: 10.1007/s11548-025-03517-8

Recent citations

Papers that recently cited this model.

Learning from Complementary Ultrasound Representations for Liver Disease Classification

Sabahattin Mert Daloglu, Gokce Bekar, Ceren Çoşkun, et al.

Jul 2026

Analyzing Image Encoder Choices and Graph Homophily in GCN Frameworks for Breast Ultrasound Classification

Sabahattin Mert Daloglu, Ceren Çoşkun, Harvey Castro, et al.

Jul 2026

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.

Jul 2026

0Influential

UltraSam

#Key Features

#Technical Details

#Applications

#Impact

Citation

Ultrasam: a foundation model for ultrasound using large open-access segmentation datasets

Recent citations

Learning from Complementary Ultrasound Representations for Liver Disease Classification

Analyzing Image Encoder Choices and Graph Homophily in GCN Frameworks for Breast Ultrasound Classification

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

UltraSam

#Key Features

#Technical Details

#Applications

#Impact

Citation

Ultrasam: a foundation model for ultrasound using large open-access segmentation datasets

Recent citations

Learning from Complementary Ultrasound Representations for Liver Disease Classification

Analyzing Image Encoder Choices and Graph Homophily in GCN Frameworks for Breast Ultrasound Classification

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact