bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Pathology foundation models
Pathology

BEPH

Shanghai Jiao Tong University

A BEiT-based self-supervised foundation model pretrained on 11M+ histopathology image tiles for cancer diagnosis, subtyping, and survival prediction.

Released: March 2025
Parameters: 86 Million

BEPH (BEiT-based Pre-training on Histopathological images) is a self-supervised foundation model for computational pathology that learns transferable visual representations from hematoxylin and eosin (H&E) stained tissue images. Developed by the Yu Lab at Shanghai Jiao Tong University and published in Nature Communications in March 2025, BEPH addresses a central bottleneck in digital pathology: most clinically relevant tasks have only modest amounts of labeled data, making it difficult to train accurate models from scratch. By pretraining on millions of unlabeled image tiles, BEPH provides a general-purpose encoder that can be efficiently fine-tuned for a wide range of cancer-related tasks.

The model is notable for adapting the BEiT v2 masked image modeling paradigm, originally developed for natural images, to gigapixel histopathology. Rather than predicting raw pixels, BEiT-style pretraining reconstructs discrete visual tokens for masked patches, which encourages the encoder to learn high-level semantic structure relevant to tissue morphology. BEPH demonstrates that this approach yields representations that transfer well across cancer types and across the patch, whole-slide, and prognostic levels of analysis.

BEPH fits alongside other pathology foundation models such as UNI, CONCH, and Virchow, but distinguishes itself with a deliberately lightweight design (an ~86M-parameter ViT-Base backbone) that lowers the barrier to local deployment and fine-tuning on commodity hardware.

#Key Features

  • BEiT v2 masked image modeling: BEPH pretrains a Vision Transformer by reconstructing discrete visual tokens for masked image patches, learning semantic tissue features without any manual annotation.
  • Large-scale pretraining: The encoder is trained on roughly 11.77 million image tiles extracted from 11,760 TCGA whole-slide images spanning 32 cancer types, giving broad morphological coverage.
  • Multi-level task support: Learned features are adapted to patch-level cancer detection, weakly supervised whole-slide-image (WSI) classification, and patient-level survival prediction.
  • Lightweight and deployable: With an ~86M-parameter ViT-Base backbone, BEPH is small enough to fine-tune and run on standard workstation GPUs.
  • Strong transfer performance: BEPH improves on ResNet and DINO baselines by up to 8.8% and 7.2% on WSI classification, and by 6.44% and 3.28% on average for survival prediction.

#Technical Details

BEPH uses a ViT-Base backbone (~86M parameters) pretrained with the BEiT v2 self-supervised objective, in which a VQ-KD visual tokenizer supplies discrete target tokens for masked patches. Pretraining data consist of 224×224 tiles sampled at 40× magnification with at least 75% tissue content, drawn from TCGA diagnostic slides. For downstream WSI tasks, tile features are aggregated using the CLAM attention-based multiple-instance-learning framework. Across reported benchmarks, BEPH reaches AUCs of 0.994 for renal cell carcinoma subtyping, 0.970 for non-small cell lung cancer subtyping, and 0.946 for breast cancer subtyping, and achieves concordance indices in the 0.59–0.71 range for survival prediction across six TCGA cohorts (BRCA, CRC, CCRCC, PRCC, LUAD, STAD). Code and pretrained weights are released under the GPL-3.0 license.

#Applications

BEPH is designed for computational pathology researchers and clinical AI developers who need a strong starting point for cancer-image analysis. Typical use cases include detecting malignancy in tissue patches, classifying cancer subtypes from whole-slide images, and stratifying patients by predicted survival risk to support prognosis. Because the backbone is lightweight and the weights are openly available, smaller labs can fine-tune BEPH on their own annotated datasets without the compute demands of larger pathology foundation models, making it suitable for both methods research and translational pipeline development.

#Impact

By showing that a BEiT-based masked-image-modeling backbone can rival or exceed contrastive and supervised baselines across diagnosis, subtyping, and survival tasks, BEPH reinforced masked image modeling as a viable pretraining strategy for histopathology. Its publication in Nature Communications, paired with an openly released GPL-3.0 codebase and pretrained weights, lowered the practical barrier to adopting foundation models in pathology research. The model's emphasis on a compact, deployable architecture offers a useful counterpoint to the trend toward ever-larger pathology encoders, with the main limitation being that its pretraining draws from TCGA, so generalization to other scanners, stains, and populations warrants further external validation.

Citation

A foundation model for generalizable cancer diagnosis and survival prediction from histopathological images

Yang, Z., et al. (2024) A foundation model for generalizable cancer diagnosis and survival prediction from histopathological images. bioRxiv.

DOI: 10.1038/s41467-025-57587-y

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations81
Influential3
References65

GitHub

Stars75
Forks7
Open Issues7
Contributors1
Last Push1y ago
LanguagePython
LicenseGPL-3.0

Fields of citing research

Not enough data

Openness

bio.rodeo opennessFully open · usable and reproducible
71Open
Usability — can I run it?82
Reproducibility — can I retrain it?56
Model Openness Framework
Class III
Open Model

Tags

cancer_diagnosiscancer_subtypingfoundation_modelhistologyself_supervisedsurvival_predictiontransfer_learningtransformervision_transformerwhole_slide_imaging

Resources

GitHub RepositoryResearch Paper