Hong Kong University of Science and Technology / Sun Yat-sen University / Southern Medical University / Chinese University of Hong Kong
A generalizable computational-pathology foundation model trained on ~190M histopathology patches via unified knowledge distillation from UNI, Phikon, and CONCH.
GPFM (Generalizable Pathology Foundation Model) is a self-supervised tile encoder for computational pathology that produces general-purpose visual features from hematoxylin and eosin (H&E) and other histopathology image patches. Developed by Jiabo Ma, Hao Chen, and colleagues at the Hong Kong University of Science and Technology in collaboration with pathology departments at Sun Yat-sen University, Southern Medical University, and the Chinese University of Hong Kong, GPFM was published in Nature Biomedical Engineering in 2025. It addresses a recurring problem in digital pathology: individual foundation models often excel on the narrow task they were tuned for but generalize unevenly across the broad spectrum of slide-level, region-level, and multimodal pathology tasks encountered in practice.
The model's central innovation is a unified knowledge distillation pretraining framework. Rather than learning solely from raw images, GPFM distills knowledge from several existing expert pathology encoders — UNI, Phikon, and CONCH — while simultaneously applying self-distillation, combining expert guidance with masked-image-modeling and self-supervised objectives. This lets a single student network inherit complementary strengths (UNI's classification accuracy, Phikon's prognostic signal, and CONCH's vision-language alignment) instead of specializing narrowly.
GPFM sits alongside pathology foundation models such as UNI, Virchow, CONCH, and Phikon, but distinguishes itself by being explicitly optimized and benchmarked for breadth, with the authors assembling one of the largest task suites used to evaluate a pathology encoder to date.
GPFM uses a ViT-L/14 Vision Transformer backbone in a DINOv2 configuration, producing 1024-dimensional feature embeddings from 224×224 RGB tiles (sampled from larger tiles at 40× magnification). Pretraining couples a masked image modeling loss with a DINO self-distillation loss, and adds an expert distillation objective that aligns the student's [CLS] and [PATCH] tokens to those of the UNI, Phikon, and CONCH teacher models using cosine similarity and smooth L1 distance. The training corpus comprises approximately 190 million patches extracted from roughly 72,000 publicly available slides covering 34 tissue types. On the assembled benchmark of 72 tasks across six task families, GPFM achieves an average rank of 1.6 and ranks first on 42 tasks, compared with an average rank of 3.7 and six first-place finishes for UNI.
GPFM serves computational-pathology researchers and clinical AI developers who need a single, reliable feature extractor across many downstream tasks. Its embeddings feed multiple-instance-learning pipelines for whole-slide cancer diagnosis and subtyping, patient-level survival and prognosis modeling, region-of-interest tissue classification and retrieval, and multimodal workflows such as pathology visual question answering and report generation. Because the weights are openly available under a permissive license, labs can extract features for their own annotated cohorts without retraining a large encoder from scratch, accelerating both methods research and translational pipeline development.
By demonstrating that distilling several specialized pathology encoders into one student can outperform each teacher across a wide task suite, GPFM advances knowledge distillation as a practical strategy for building general-purpose pathology foundation models. Its publication in Nature Biomedical Engineering, paired with openly released MIT-licensed weights and a large, multi-family benchmark, gives the community both a strong off-the-shelf encoder and a more demanding yardstick for evaluating generalization. The main limitation is that pretraining and benchmarking draw heavily on publicly available cohorts, so performance on novel scanners, stains, rare tissue types, and prospective clinical populations warrants further external validation.
Ma, J., et al. (2025) A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.. Nature Biomedical Engineering.
DOI: 10.1038/s41551-025-01488-4Papers that recently cited this model.
The most-cited papers that cite this model.
Not enough data