bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Imaging foundation models
Imaging

MedicoSAM

Computational Cell Analytics

A Segment Anything Model (SAM) variant finetuned on diverse medical images, delivering a reusable promptable checkpoint for interactive and automatic medical image segmentation.

Released: January 2025

MedicoSAM adapts the Segment Anything Model (SAM), Meta AI's prompt-driven vision foundation model, to the medical imaging domain. SAM revolutionized natural-image segmentation by producing high-quality masks from simple prompts such as points or bounding boxes, but its zero-shot performance degrades on medical data, where contrast, modality, and anatomical structures differ sharply from everyday photographs. MedicoSAM closes much of that gap by finetuning SAM on large, diverse collections of publicly available medical images, yielding a reusable promptable checkpoint that practitioners can drop into existing SAM-based annotation tools.

The model was developed by Anwai Archit, Luca Freckmann, and Constantin Pape of the Computational Cell Analytics group at the University of Göttingen, and introduced in a January 2025 preprint that was subsequently published in IEEE Transactions on Medical Imaging in 2025. Beyond releasing weights, the paper is a systematic study of how best to transfer SAM to medical imaging: it compares finetuning strategies across many datasets and evaluates them on both interactive (prompt-based) and automatic semantic segmentation.

A central, candid finding is that medical pretraining yields large, consistent improvements for interactive segmentation but only modest gains for automatic semantic segmentation. This nuance positions MedicoSAM as a practical accelerator for human-in-the-loop annotation rather than a turnkey replacement for task-specific segmentation networks.

#Key Features

  • Promptable interactive segmentation: Built on SAM's point- and box-prompt interface, MedicoSAM produces accurate masks from minimal user input, substantially outperforming the base SAM on medical data.
  • 2D and 3D support: The released model handles both 2D slices and volumetric (3D) medical data, covering modalities such as CT, MRI, and X-ray.
  • Semantic segmentation decoder: An additional pretrained decoder enables automatic semantic segmentation in addition to the interactive workflow.
  • Tool compatibility: The checkpoint is designed to slot directly into the group's micro-sam annotator, so existing SAM-based pipelines can adopt it with minimal change.
  • Open and reusable: Code and the vit_b_medicosam.pt checkpoint are released under the permissive MIT license for downstream finetuning and deployment.

#Technical Details

MedicoSAM is built on SAM's ViT-B image encoder paired with SAM's lightweight mask decoder, and adds a separately pretrained semantic-segmentation decoder. Finetuning is performed on a large, heterogeneous corpus of publicly available medical images spanning multiple modalities and anatomies, using the interactive training scheme that simulates user prompts during optimization. The authors benchmark several transfer strategies and report that the resulting interactive model improves markedly over both vanilla SAM and prior medical SAM variants on held-out interactive segmentation tasks, while gains on automatic semantic segmentation remain comparatively limited. The released artifact is a single vit_b_medicosam.pt checkpoint distributed via the group's owncloud server.

#Applications

MedicoSAM is most valuable as an annotation accelerator for radiologists, clinical researchers, and imaging scientists who must label large volumes of CT, MRI, or X-ray data. By turning a few clicks into accurate masks, it reduces the manual effort of building segmentation ground truth, supports rapid dataset curation for downstream model training, and integrates into interactive tools for both 2D and 3D workflows. Its permissive license also makes it a convenient starting point for groups finetuning their own modality- or task-specific medical segmentation models.

#Impact

By rigorously characterizing where SAM-style pretraining helps in medicine — strongly for interactive use, weakly for fully automatic semantic segmentation — MedicoSAM provides the community with both a practical checkpoint and an honest evaluation that tempers expectations around medical foundation-model segmentation. Released openly under MIT and compatible with the widely adopted micro-sam ecosystem, it lowers the barrier to high-quality medical image annotation. Its main limitation is the modest improvement it offers for automatic semantic segmentation, where dedicated task-specific architectures still tend to lead.

Citation

MedicoSAM: Robust Improvement of SAM for Medical Imaging

Archit, A., et al. (2025) MedicoSAM: Robust Improvement of SAM for Medical Imaging. IEEE Transactions on Medical Imaging.

DOI: 10.1109/TMI.2025.3644811

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations6
Influential0
References85

GitHub

Stars31
Forks2
Open Issues1
Contributors3
Last Push4d ago
LanguagePython
LicenseMIT

Fields of citing research

Not enough data

Openness

bio.rodeo opennessFully open · usable and reproducible
77Open
Usability — can I run it?95
Reproducibility — can I retrain it?58
Model Openness Framework
Unclassified
Missing required components

Tags

foundation_modelinteractive_annotationmedical_imagingradiologysegmentationtransfer_learningvision_transformer

Resources

GitHub RepositoryResearch PaperResearch Paper