bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Pathology foundation models
Pathology

GPFM

Hong Kong University of Science and Technology / Sun Yat-sen University / Southern Medical University / Chinese University of Hong Kong

A generalizable computational-pathology foundation model trained on ~190M histopathology patches via unified knowledge distillation from UNI, Phikon, and CONCH.

Released: November 2025

GPFM (Generalizable Pathology Foundation Model) is a self-supervised tile encoder for computational pathology that produces general-purpose visual features from hematoxylin and eosin (H&E) and other histopathology image patches. Developed by Jiabo Ma, Hao Chen, and colleagues at the Hong Kong University of Science and Technology in collaboration with pathology departments at Sun Yat-sen University, Southern Medical University, and the Chinese University of Hong Kong, GPFM was published in Nature Biomedical Engineering in 2025. It addresses a recurring problem in digital pathology: individual foundation models often excel on the narrow task they were tuned for but generalize unevenly across the broad spectrum of slide-level, region-level, and multimodal pathology tasks encountered in practice.

The model's central innovation is a unified knowledge distillation pretraining framework. Rather than learning solely from raw images, GPFM distills knowledge from several existing expert pathology encoders — UNI, Phikon, and CONCH — while simultaneously applying self-distillation, combining expert guidance with masked-image-modeling and self-supervised objectives. This lets a single student network inherit complementary strengths (UNI's classification accuracy, Phikon's prognostic signal, and CONCH's vision-language alignment) instead of specializing narrowly.

GPFM sits alongside pathology foundation models such as UNI, Virchow, CONCH, and Phikon, but distinguishes itself by being explicitly optimized and benchmarked for breadth, with the authors assembling one of the largest task suites used to evaluate a pathology encoder to date.

#Key Features

  • Unified knowledge distillation: GPFM combines expert knowledge distillation from UNI, Phikon, and CONCH with self-distillation, transferring complementary capabilities into one student encoder.
  • Massive, diverse pretraining: The encoder is trained on roughly 190 million image tiles drawn from about 72,000 whole-slide images spanning 34 major tissue types, giving broad morphological and organ coverage.
  • Broad task generalization: GPFM is evaluated across 72 clinical tasks covering slide-level classification, survival prediction, ROI tissue classification, ROI retrieval, visual question answering, and report generation.
  • Strong benchmark standing: Across the benchmark it attains an average rank of 1.6 with 42 first-place finishes, ahead of the next-best model (UNI, average rank 3.7).
  • Open weights and code: Pretrained weights are released on HuggingFace under an MIT license, with training and evaluation code available on GitHub.

#Technical Details

GPFM uses a ViT-L/14 Vision Transformer backbone in a DINOv2 configuration, producing 1024-dimensional feature embeddings from 224×224 RGB tiles (sampled from larger tiles at 40× magnification). Pretraining couples a masked image modeling loss with a DINO self-distillation loss, and adds an expert distillation objective that aligns the student's [CLS] and [PATCH] tokens to those of the UNI, Phikon, and CONCH teacher models using cosine similarity and smooth L1 distance. The training corpus comprises approximately 190 million patches extracted from roughly 72,000 publicly available slides covering 34 tissue types. On the assembled benchmark of 72 tasks across six task families, GPFM achieves an average rank of 1.6 and ranks first on 42 tasks, compared with an average rank of 3.7 and six first-place finishes for UNI.

#Applications

GPFM serves computational-pathology researchers and clinical AI developers who need a single, reliable feature extractor across many downstream tasks. Its embeddings feed multiple-instance-learning pipelines for whole-slide cancer diagnosis and subtyping, patient-level survival and prognosis modeling, region-of-interest tissue classification and retrieval, and multimodal workflows such as pathology visual question answering and report generation. Because the weights are openly available under a permissive license, labs can extract features for their own annotated cohorts without retraining a large encoder from scratch, accelerating both methods research and translational pipeline development.

#Impact

By demonstrating that distilling several specialized pathology encoders into one student can outperform each teacher across a wide task suite, GPFM advances knowledge distillation as a practical strategy for building general-purpose pathology foundation models. Its publication in Nature Biomedical Engineering, paired with openly released MIT-licensed weights and a large, multi-family benchmark, gives the community both a strong off-the-shelf encoder and a more demanding yardstick for evaluating generalization. The main limitation is that pretraining and benchmarking draw heavily on publicly available cohorts, so performance on novel scanners, stains, rare tissue types, and prospective clinical populations warrants further external validation.

Citation

A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.

Ma, J., et al. (2025) A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.. Nature Biomedical Engineering.

DOI: 10.1038/s41551-025-01488-4

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations37
Influential0
References67

GitHub

Stars126
Forks7
Open Issues5
Contributors3
Last Push3mo ago
LanguagePython

HuggingFace

Downloads0
Likes3
Last Modified5mo ago
Pipelinefeature-extraction

Fields of citing research

Not enough data

Openness

bio.rodeo opennessFully open · usable and reproducible
84Open
Usability — can I run it?100
Reproducibility — can I retrain it?66
Model Openness Framework
Class II
Open Tooling

Tags

cancer_diagnosisfeature_extractionfoundation_modelhistologyimage_retrievalknowledge_distillationself_supervisedsurvival_predictiontransformervision_transformerwhole_slide_imaging

Resources

GitHub RepositoryResearch PaperHuggingFace Model