GPFM

Hong Kong University of Science and Technology / Sun Yat-sen University / Southern Medical University / Chinese University of Hong Kong

Histopathology foundation model extracting general-purpose features from H&E patches by distilling the UNI, Phikon, and CONCH pathology encoders.

Released: November 2025

GPFM (Generalizable Pathology Foundation Model) is a self-supervised tile encoder for computational pathology that produces general-purpose visual features from hematoxylin and eosin (H&E) and other histopathology image patches. Developed by Jiabo Ma, Hao Chen, and colleagues at the Hong Kong University of Science and Technology in collaboration with pathology departments at Sun Yat-sen University, Southern Medical University, and the Chinese University of Hong Kong, GPFM was published in Nature Biomedical Engineering in 2025. It addresses a recurring problem in digital pathology: individual foundation models often excel on the narrow task they were tuned for but generalize unevenly across the broad spectrum of slide-level, region-level, and multimodal pathology tasks encountered in practice.

The model's central innovation is a unified knowledge distillation pretraining framework. Rather than learning solely from raw images, GPFM distills knowledge from several existing expert pathology encoders — UNI, Phikon, and CONCH — while simultaneously applying self-distillation, combining expert guidance with masked-image-modeling and self-supervised objectives. This lets a single student network inherit complementary strengths (UNI's classification accuracy, Phikon's prognostic signal, and CONCH's vision-language alignment) instead of specializing narrowly.

GPFM sits alongside pathology foundation models such as UNI, Virchow, CONCH, and Phikon, but distinguishes itself by being explicitly optimized and benchmarked for breadth, with the authors assembling one of the largest task suites used to evaluate a pathology encoder to date.

Key Features

Unified knowledge distillation: GPFM combines expert knowledge distillation from UNI, Phikon, and CONCH with self-distillation, transferring complementary capabilities into one student encoder.
Massive, diverse pretraining: The encoder is trained on roughly 190 million image tiles drawn from about 72,000 whole-slide images spanning 34 major tissue types, giving broad morphological and organ coverage.
Broad task generalization: GPFM is evaluated across 72 clinical tasks covering slide-level classification, survival prediction, ROI tissue classification, ROI retrieval, visual question answering, and report generation.
Strong benchmark standing: Across the benchmark it attains an average rank of 1.6 with 42 first-place finishes, ahead of the next-best model (UNI, average rank 3.7).
Open weights and code: Pretrained weights are released on HuggingFace under an MIT license, with training and evaluation code available on GitHub.

Technical Details

GPFM uses a ViT-L/14 Vision Transformer backbone in a DINOv2 configuration, producing 1024-dimensional feature embeddings from 224×224 RGB tiles (sampled from larger tiles at 40× magnification). Pretraining couples a masked image modeling loss with a DINO self-distillation loss, and adds an expert distillation objective that aligns the student's [CLS] and [PATCH] tokens to those of the UNI, Phikon, and CONCH teacher models using cosine similarity and smooth L1 distance. The training corpus comprises approximately 190 million patches extracted from roughly 72,000 publicly available slides covering 34 tissue types. On the assembled benchmark of 72 tasks across six task families, GPFM achieves an average rank of 1.6 and ranks first on 42 tasks, compared with an average rank of 3.7 and six first-place finishes for UNI.

Applications

GPFM serves computational-pathology researchers and clinical AI developers who need a single, reliable feature extractor across many downstream tasks. Its embeddings feed multiple-instance-learning pipelines for whole-slide cancer diagnosis and subtyping, patient-level survival and prognosis modeling, region-of-interest tissue classification and retrieval, and multimodal workflows such as pathology visual question answering and report generation. Because the weights are openly available under a permissive license, labs can extract features for their own annotated cohorts without retraining a large encoder from scratch, accelerating both methods research and translational pipeline development.

Impact

By demonstrating that distilling several specialized pathology encoders into one student can outperform each teacher across a wide task suite, GPFM advances knowledge distillation as a practical strategy for building general-purpose pathology foundation models. Its publication in Nature Biomedical Engineering, paired with openly released MIT-licensed weights and a large, multi-family benchmark, gives the community both a strong off-the-shelf encoder and a more demanding yardstick for evaluating generalization. The main limitation is that pretraining and benchmarking draw heavily on publicly available cohorts, so performance on novel scanners, stains, rare tissue types, and prospective clinical populations warrants further external validation.

Citation

A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.

Ma, J., et al. (2025) A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.. Nature Biomedical Engineering.

DOI: 10.1038/s41551-025-01488-4

Recent citations

Papers that recently cited this model.

Toward brain magnetic resonance imaging analysis intelligence: A review of federated learning and visual foundation models
Zhen Yu, Yang Liu, Qingchao Chen
Engineering applications of artificial intelligence · Aug 2026
0
Pretraining Multiple Instance Learning Networks with Multi-Teacher Distillation from Pathology Slide Foundation Models
Mingxi Fu, Jiawen Li, Renao Yan, et al.
Jul 2026
0
LaGuadia: Language-Guided Adaptive Distillation from Pathology Foundation Models
Gangsu Kim, Won-Ki Jeong
Jul 2026
0

Top citations

The most-cited papers that cite this model.

PathOrchestra: a comprehensive foundation model for computational pathology with over 100 diverse clinical-grade tasks
Fang Yan, Jianfeng Wu, Jiawen Li, et al.
npj Digital Medicine · Mar 2025
15
Large-scale self-supervised video foundation model for intelligent surgery
Shu Yang, Fengtao Zhou, Leon D. Mayer, et al.
npj Digital Medicine · Jun 2025
14
Uncertainty-aware ensemble of foundation models differentiates glioblastoma from its mimics
Junhan Zhao, Shih-Yen Lin, Raphael Attias, et al.
Nature Communications · Sep 2025
11
A Fully Open and Generalizable Foundation Model for Ultrasound Clinical Applications
Hongyuan Zhang, Yuheng Wu, Mingyang Zhao, et al.
arXiv.org · Sep 2025
10
Emerging AI approaches for cancer spatial omics
Javad Noorbakhsh, Ali Foroughi pour, Jeffrey Chuang
GigaScience · Oct 2025
7

Citations

Total Citations54

Influential1

References67

GitHub

Stars129

Forks7

Open Issues5

Contributors3

Last Push4mo ago

LanguagePython

HuggingFace

Downloads0

Likes3

Last Modified7mo ago

Pipelinefeature-extraction

Fields of citing research

Computer Science98%
Medicine94%
Engineering12%
Biology6%
Mathematics2%
Environmental Science2%

Share of papers citing this model.

Openness

bio.rodeo opennessFully open · usable and reproducible

84Open

Usability — can I run it?100

Reproducibility — can I retrain it?66

Model Openness Framework

Class II

Open Tooling

Resources

GitHub Repository Research Paper HuggingFace Model

Key Features

Unified knowledge distillation: GPFM combines expert knowledge distillation from UNI, Phikon, and CONCH with self-distillation, transferring complementary capabilities into one student encoder.

Massive, diverse pretraining: The encoder is trained on roughly 190 million image tiles drawn from about 72,000 whole-slide images spanning 34 major tissue types, giving broad morphological and organ coverage.

Broad task generalization: GPFM is evaluated across 72 clinical tasks covering slide-level classification, survival prediction, ROI tissue classification, ROI retrieval, visual question answering, and report generation.

Strong benchmark standing: Across the benchmark it attains an average rank of 1.6 with 42 first-place finishes, ahead of the next-best model (UNI, average rank 3.7).

Open weights and code: Pretrained weights are released on HuggingFace under an MIT license, with training and evaluation code available on GitHub.

Technical Details

Applications

Impact

Citation

A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.

Ma, J., et al. (2025) A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.. Nature Biomedical Engineering.

DOI: 10.1038/s41551-025-01488-4

Recent citations

Papers that recently cited this model.

Toward brain magnetic resonance imaging analysis intelligence: A review of federated learning and visual foundation models

Zhen Yu, Yang Liu, Qingchao Chen

Engineering applications of artificial intelligence · Aug 2026

Pretraining Multiple Instance Learning Networks with Multi-Teacher Distillation from Pathology Slide Foundation Models

Mingxi Fu, Jiawen Li, Renao Yan, et al.

Jul 2026

LaGuadia: Language-Guided Adaptive Distillation from Pathology Foundation Models

Gangsu Kim, Won-Ki Jeong

Jul 2026

GPFM

#Key Features

#Technical Details

#Applications

#Impact

Citation

A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.

Recent citations

Pretraining Multiple Instance Learning Networks with Multi-Teacher Distillation from Pathology Slide Foundation Models

LaGuadia: Language-Guided Adaptive Distillation from Pathology Foundation Models

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

GPFM

#Key Features

#Technical Details

#Applications

#Impact

Citation

A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.

Recent citations

Pretraining Multiple Instance Learning Networks with Multi-Teacher Distillation from Pathology Slide Foundation Models

LaGuadia: Language-Guided Adaptive Distillation from Pathology Foundation Models

Top citations

Related models

Citations

GitHub

HuggingFace

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact