Med3D

Pretrained 3D-ResNet backbones for volumetric medical image analysis, co-trained across eight CT and MRI segmentation datasets for transfer learning.

Released: April 2019

Med3D is a family of pretrained 3D convolutional neural networks designed to bring ImageNet-style transfer learning to volumetric medical image analysis. Whereas 2D natural-image backbones can be pretrained on millions of labeled photographs, 3D medical datasets are small, expensive to annotate, and fragmented across imaging modalities (CT, MRI), anatomical regions, and pathologies. Med3D addresses this scarcity by co-training a single heterogeneous 3D network across many segmentation datasets at once, then releasing the resulting backbones as reusable weights for downstream tasks.

Introduced in the 2019 paper "Med3D: Transfer Learning for 3D Medical Image Analysis" by Sihong Chen, Kai Ma, and Yefeng Zheng at Tencent's healthcare AI group, the work aggregates eight public 3D segmentation challenges into a combined corpus the authors call 3DSeg-8. A shared encoder learns general-purpose volumetric features, while dataset-specific decoder branches handle the differing label spaces of each source. The pretrained encoders are distributed through the companion MedicalNet repository as a series of 3D-ResNet checkpoints.

Med3D sits at the foundation-model end of medical imaging: rather than a single task-specific predictor, it provides transferable representations that practitioners fine-tune for segmentation, classification, and detection on their own scans, substantially reducing the data and compute needed to reach strong performance.

Key Features

Multi-domain co-training: A single heterogeneous 3D network is trained jointly across the 3DSeg-8 corpus, sharing an encoder while using per-dataset decoder branches to accommodate heterogeneous modalities, organs, and label sets.
Family of 3D-ResNet backbones: Pretrained weights are provided for ResNet-10, 18, 34, 50, 101, 152, and 200, letting users trade off capacity against available data and compute.
Released transfer-learning weights: The MedicalNet repository ships ready-to-use checkpoints (including updated 23-dataset variants) under an MIT license, with training and testing code for fine-tuning.
Faster convergence: Initializing from Med3D weights converges roughly 2x faster than Kinetics video pretraining and about 10x faster than training from scratch.
Accuracy gains on scarce data: Fine-tuning the pretrained backbones yields reported improvements of 3-20% over from-scratch training across downstream medical tasks.

Technical Details

The Med3D backbones are 3D-ResNets with configurable shortcut connections (type A and type B), operating directly on volumetric inputs. Pretraining uses the 3DSeg-8 dataset, an aggregation of eight public 3D segmentation challenges spanning multiple modalities, organs, and pathologies; later released checkpoints extend co-training to 23 datasets. The shared encoder is optimized across all source segmentation tasks simultaneously, with separate decoder heads per dataset, so the encoder learns modality- and organ-agnostic features that transfer well. On downstream evaluation, attaching Med3D-pretrained encoders to segmentation heads produced strong results, for example a ResNet-50 backbone reaching 93.31% Dice on lung segmentation (versus 71.75% from scratch), and the authors report a 94.6% Dice coefficient on the LiTS liver challenge when combined with a DenseASPP segmentation network. The reference implementation targets PyTorch.

Applications

Med3D backbones serve as drop-in feature extractors for 3D medical imaging pipelines, where labeled data is typically limited. Researchers and clinical-AI developers fine-tune the released checkpoints for organ and lesion segmentation (lung, liver, and other structures), nodule and disease classification, and detection tasks on CT and MRI volumes. Because the weights are openly available with training code, smaller labs and groups without large annotated cohorts can reach competitive accuracy without pretraining their own 3D networks, making Med3D a common starting point for volumetric medical-imaging projects.

Impact

Med3D was an early and influential demonstration that transfer learning, long standard in 2D natural-image vision, can be made to work for 3D medical imaging despite fragmented, modality-diverse data. The MedicalNet release of pretrained 3D-ResNet weights has been widely reused as initialization for downstream CT and MRI tasks and is frequently cited as a baseline in volumetric medical-imaging research. Its main limitations follow from its design: the backbones are convolutional rather than transformer-based, the pretraining corpus is modest by modern standards, and the learned features are biased toward the organs and modalities present in the source segmentation datasets, so performance on out-of-distribution modalities may require additional adaptation.

Citation

Med3D: Transfer Learning for 3D Medical Image Analysis

Preprint

Chen, S., et al. (2019) Med3D: Transfer Learning for 3D Medical Image Analysis. arXiv.org.

DOI: 10.48550/arXiv.1904.00625

Recent citations

Papers that recently cited this model.

Multimodal Semantic-Aware Contrastive Learning For False Negative Mitigation in 3D Medical Imaging
Sara Ketabi, M. Wagner, Cynthia E. Hawkins, et al.
Jul 2026
0
APLG-Net: an anatomy-guided local-global hybrid network with progression-aware supervision for structural MRI-based NC/MCI/AD classification
Bin Shi, Zhimin Wang, Jing Lian, et al.
Frontiers in Neurology · Jul 2026
0
CHM-Net: Center Heatmap-driven Macro-Micro Modeling Network for MRI-based Microbial Density Stratification
Jiaming Liang, Hao Chen, Ting Li, et al.
Jul 2026
0

Top citations

The most-cited papers that cite this model.

A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises
S. K. Zhou, H. Greenspan, C. Davatzikos, et al.
Proceedings of the IEEE · Aug 2020
961
Recent advances and clinical applications of deep learning in medical image analysis
Xuxin Chen, Xi-ming Wang, Kecheng Zhang, et al.
Medical Image Anal. · May 2021
944
Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis
Yucheng Tang, Dong Yang, Wenqi Li, et al.
Computer Vision and Pattern Recognition · Nov 2021
836
Deep Learning for Cardiac Image Segmentation: A Review
Chen Chen, C. Qin, Huaqi Qiu, et al.
Frontiers in Cardiovascular Medicine · Nov 2019
834
Big Self-Supervised Models Advance Medical Image Classification
Shekoofeh Azizi, Basil Mustafa, Fiona Ryan, et al.
IEEE International Conference on Computer Vision · Jan 2021
715

Citations

Total Citations682

Influential66

References43

GitHub

Stars2.2K

Forks462

Open Issues76

Contributors1

Last Push8mo ago

LanguagePython

Fields of citing research

Computer Science58%
Medicine57%
Engineering27%
Biology3%
Physics2%
Mathematics1%
Environmental Science1%
Materials Science0%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

75Open

Usability — can I run it?94

Reproducibility — can I retrain it?57

Model Openness Framework

Class III

Open Model

Resources

GitHub Repository Research Paper

Key Features

Multi-domain co-training: A single heterogeneous 3D network is trained jointly across the 3DSeg-8 corpus, sharing an encoder while using per-dataset decoder branches to accommodate heterogeneous modalities, organs, and label sets.

Family of 3D-ResNet backbones: Pretrained weights are provided for ResNet-10, 18, 34, 50, 101, 152, and 200, letting users trade off capacity against available data and compute.

Released transfer-learning weights: The MedicalNet repository ships ready-to-use checkpoints (including updated 23-dataset variants) under an MIT license, with training and testing code for fine-tuning.

Faster convergence: Initializing from Med3D weights converges roughly 2x faster than Kinetics video pretraining and about 10x faster than training from scratch.

Accuracy gains on scarce data: Fine-tuning the pretrained backbones yields reported improvements of 3-20% over from-scratch training across downstream medical tasks.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

Multimodal Semantic-Aware Contrastive Learning For False Negative Mitigation in 3D Medical Imaging

Sara Ketabi, M. Wagner, Cynthia E. Hawkins, et al.

Jul 2026

APLG-Net: an anatomy-guided local-global hybrid network with progression-aware supervision for structural MRI-based NC/MCI/AD classification

Bin Shi, Zhimin Wang, Jing Lian, et al.

Frontiers in Neurology · Jul 2026

CHM-Net: Center Heatmap-driven Macro-Micro Modeling Network for MRI-based Microbial Density Stratification

Jiaming Liang, Hao Chen, Ting Li, et al.

Jul 2026

Med3D

#Key Features

#Technical Details

#Applications

#Impact

Citation

Med3D: Transfer Learning for 3D Medical Image Analysis

Recent citations

Multimodal Semantic-Aware Contrastive Learning For False Negative Mitigation in 3D Medical Imaging

CHM-Net: Center Heatmap-driven Macro-Micro Modeling Network for MRI-based Microbial Density Stratification

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Med3D

#Key Features

#Technical Details

#Applications

#Impact

Citation

Med3D: Transfer Learning for 3D Medical Image Analysis

Recent citations

Multimodal Semantic-Aware Contrastive Learning For False Negative Mitigation in 3D Medical Imaging

CHM-Net: Center Heatmap-driven Macro-Micro Modeling Network for MRI-based Microbial Density Stratification

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact