USFM

Ultrasound foundation model pretrained on over two million multi-organ images, transferring to segmentation, classification, and image enhancement.

Released: August 2024

USFM (Universal Ultrasound Foundation Model) is a self-supervised foundation model for medical ultrasound image analysis, developed by the Laboratory of Medical Imaging and Artificial Intelligence at Fudan University and published in Medical Image Analysis in 2024. Ultrasound is among the most widely used clinical imaging modalities, but deep-learning models for it have historically been narrow: trained organ-by-organ and task-by-task, requiring large annotated datasets that are costly to acquire because ultrasound interpretation depends on specialist expertise. USFM aims to break this bottleneck with a single pretrained backbone that transfers across organs, diseases, and task types.

The model addresses two challenges that make ultrasound harder to model than natural images or other medical scans. First, ultrasound images are noisy and low-contrast, with speckle and operator-dependent acquisition that obscure anatomical structure. Second, pretraining data must span many organs and devices to generalize. To handle these, USFM is pretrained on a large multi-organ, multi-center, multi-device database of over two million ultrasound images using a novel spatial-frequency dual masked image modeling objective designed to learn robust features despite degraded image quality.

By learning general-purpose ultrasound representations once and fine-tuning them on small labeled datasets, USFM positions itself as a label-efficient backbone for the full spectrum of downstream ultrasound tasks rather than a single-purpose classifier or segmenter.

Key Features

Spatial-frequency dual masked image modeling: USFM extends masked image modeling with a frequency band-stop masking branch alongside spatial masking, forcing the model to reconstruct information in both the image and frequency domains for more robust feature learning on noisy ultrasound.
Multi-organ, multi-device pretraining: Self-supervised pretraining on 2M+ ultrasound images from many organs, centers, and devices, with organ-balanced sampling to avoid bias toward data-rich anatomies.
Task-agnostic transfer: A single backbone supports segmentation, classification, and image enhancement/restoration across diverse organs and diseases without task-specific architectural redesign.
Label efficiency: Achieves strong downstream performance with as little as 20% of the annotations required by training from scratch, reducing the expert-labeling burden that limits clinical ultrasound AI.
Open weights and code: Pretrained weights and fine-tuning code are released publicly, including a toy dataset for reproducing the segmentation and classification pipelines.

Technical Details

USFM uses a Vision Transformer (ViT) backbone pretrained with a self-supervised spatial-frequency dual masked image modeling scheme. The spatial branch follows masked image modeling with a noise addition-and-recovery formulation suited to ultrasound speckle, while the frequency branch applies band-stop masking so the model must recover suppressed frequency components. Pretraining draws on a curated database of more than two million ultrasound images spanning multiple organs, clinical centers, and ultrasound devices, with organ-balanced sampling to promote generalizability. For downstream evaluation, the pretrained encoder is paired with standard task heads (for example SegViT or UperNet for segmentation), and the authors report that USFM matches or exceeds competing approaches across segmentation, classification, and image-enhancement benchmarks while using substantially fewer labeled examples and fewer training epochs. Released weights (USFM_latest.pth) are distributed under a CC-BY-NC 4.0 license.

Applications

USFM serves as a transferable backbone for clinical and research ultrasound analysis: segmenting lesions and anatomical structures, classifying disease (such as benign-versus-malignant assessment), and enhancing low-quality scans. Researchers and clinical AI developers benefit most, because the pretrained model lets them build accurate task-specific systems from small annotated datasets, lowering the barrier for ultrasound applications across organs—breast, thyroid, liver, cardiac, obstetric, and others—where assembling large expert-labeled corpora is impractical.

Impact

USFM is among the first general-purpose foundation models targeted specifically at medical ultrasound, a modality long underserved relative to CT, MRI, and histopathology in foundation-model research. By demonstrating that a single self-supervised backbone can generalize across organs and tasks while cutting annotation requirements, it provides a practical template for label-efficient ultrasound AI and has been incorporated into the OpenMedLab ecosystem of open medical foundation models. Its main limitations are those common to the class: the pretraining corpus, while large and diverse, is not fully described publicly, the weights are restricted to non-commercial use, and downstream performance still depends on quality fine-tuning data for each clinical target.

Citation

USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis

Jiao, J., et al. (2023) USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis. Medical Image Anal..

DOI: 10.1016/j.media.2024.103202

Recent citations

Papers that recently cited this model.

Fractal-guided multi-scale contrastive learning for robust liver tumor classification in ultrasound
Xuping Zhang, Qingyu Zhang, Tao Zhang, et al.
Engineering applications of artificial intelligence · Aug 2026
0
Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation
A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.
Jul 2026
0Influential
Wavelet-inspired diffusion model with near-field constraint for real-time echocardiography dehazing.
Xue Gao, Peng Huang, Fangyan Tian, et al.
Medical Image Analysis · Jul 2026
0

Top citations

The most-cited papers that cite this model.

A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan, Seowung Leem, Kyle B. See, et al.
IEEE Reviews in Biomedical Engineering · Jun 2024
115
Generative artificial intelligence in medicine
Zhen Ling Teo, A. Thirunavukarasu, Kabilan Elangovan, et al.
Nature Medicine · Oct 2025
43
SAM-Med3D: A Vision Foundation Model for General-Purpose Segmentation on Volumetric Medical Images
Haoyu Wang, Sizheng Guo, Jin Ye, et al.
IEEE Transactions on Neural Networks and Learning Systems · Jul 2025
42
Ultrasam: a foundation model for ultrasound using large open-access segmentation datasets
A. Meyer, Aditya Murali, Farahdiba Zarin, et al.
International Journal of Computer Assisted Radiology and Surgery · Nov 2024
29
Visual–language foundation models in medicine
Chunyu Liu, Yixiao Jin, Zhouyu Guan, et al.
The Visual Computer · Jul 2024
29

Citations

Total Citations112

Influential12

References45

GitHub

Stars350

Forks34

Open Issues21

Contributors1

Last Push3mo ago

LanguagePython

Fields of citing research

Medicine97%
Computer Science91%
Engineering53%
Physics2%
Biology2%
Materials Science2%
Mathematics1%
Linguistics1%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

21Closed

Usability — can I run it?14

Reproducibility — can I retrain it?13

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper Official Website

Key Features

Spatial-frequency dual masked image modeling: USFM extends masked image modeling with a frequency band-stop masking branch alongside spatial masking, forcing the model to reconstruct information in both the image and frequency domains for more robust feature learning on noisy ultrasound.

Multi-organ, multi-device pretraining: Self-supervised pretraining on 2M+ ultrasound images from many organs, centers, and devices, with organ-balanced sampling to avoid bias toward data-rich anatomies.

Task-agnostic transfer: A single backbone supports segmentation, classification, and image enhancement/restoration across diverse organs and diseases without task-specific architectural redesign.

Label efficiency: Achieves strong downstream performance with as little as 20% of the annotations required by training from scratch, reducing the expert-labeling burden that limits clinical ultrasound AI.

Open weights and code: Pretrained weights and fine-tuning code are released publicly, including a toy dataset for reproducing the segmentation and classification pipelines.

Technical Details

Applications

Impact

Citation

USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis

Jiao, J., et al. (2023) USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis. Medical Image Anal..

DOI: 10.1016/j.media.2024.103202

Recent citations

Papers that recently cited this model.

Fractal-guided multi-scale contrastive learning for robust liver tumor classification in ultrasound

Xuping Zhang, Qingyu Zhang, Tao Zhang, et al.

Engineering applications of artificial intelligence · Aug 2026

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

A. Vergara-Richart, Xavier Rafael-Palou, A. Fuster-Matanzo, et al.

Jul 2026

0Influential

Wavelet-inspired diffusion model with near-field constraint for real-time echocardiography dehazing.

Xue Gao, Peng Huang, Fangyan Tian, et al.

Medical Image Analysis · Jul 2026

Top citations

The most-cited papers that cite this model.

A Comprehensive Survey of Foundation Models in Medicine

Wasif Khan, Seowung Leem, Kyle B. See, et al.

IEEE Reviews in Biomedical Engineering · Jun 2024

115

USFM

#Key Features

#Technical Details

#Applications

#Impact

Citation

USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis

Recent citations

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

USFM

#Key Features

#Technical Details

#Applications

#Impact

Citation

USFM: A universal ultrasound foundation model generalized to tasks and organs towards label efficient image analysis

Recent citations

Vision Foundation Models in Radiology: A Scoping Review of Data, Methodology, Evaluation and Clinical Translation

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact