Pathology foundation models
Pathology

Pathology Models

Histology and tissue imaging analysis

58 models in this category

What pathology foundation models do

Computational pathology foundation models apply vision transformers and self-supervised learning to whole-slide histology images, learning representations of tissue architecture, cell morphology, and spatial patterns at a scale and consistency no human pathologist can match across a full slide. These models — including UNI, Virchow, and CONCH — are pretrained on millions of histopathology image patches from diverse tissue types, giving them broad generalization across cancer subtypes, staining protocols, and scanning hardware. The self-supervised pretraining strategy means these representations transfer to new tasks without requiring large sets of expert-annotated slides.

Applications: grading, biomarker prediction, and survival analysis

Cancer grading and subtype classification are the most mature applications, with several models matching or exceeding specialist pathologist agreement on standard grading tasks for prostate, colorectal, and breast cancers. Biomarker prediction from H&E stains — inferring genomic features like microsatellite instability, BRCA mutation status, or HER2 amplification directly from routine histology without additional molecular testing — has moved from research into clinical validation studies. Survival analysis and treatment response prediction represent the frontier, where models like CONCH combine pathology image features with clinical text to improve prognostic accuracy.

Notable Models

Top-rated pathology models from our evaluations

LLaVA-Med

Microsoft Research

Released June 1, 2023

1.8K19.7K2.2K

A biomedical vision-language assistant from Microsoft Research, adapted from LLaVA via curriculum learning on PubMed Central figure-caption pairs and GPT-4-generated instructions.

PathologyLanguage model
28Openness

Prov-GigaPath

Microsoft Research

Released May 22, 2024

83861.5K617

Whole-slide pathology foundation model pretrained on 1.3 billion tiles from 171,189 clinical WSIs. Achieves state-of-the-art on 25 of 26 pathology benchmark tasks.

Pathology
58Openness

GPFM

Hong Kong University of Science and Technology +3 others

Released November 1, 2025

37126

A generalizable computational-pathology foundation model trained on ~190M histopathology patches via unified knowledge distillation from UNI, Phikon, and CONCH.

Pathology
84Openness

UNI

Mahmood Lab

Released March 22, 2024

1.4K96K743

Self-supervised pathology foundation model (ViT-L/16, DINOv2) pretrained on 100M+ H&E tiles from 100,000+ whole-slide images. State-of-the-art on 34 pathology tasks.

Pathology
46Openness

Hibou

HistAI

Released June 7, 2024

7750.8K78

DINOv2-based Vision Transformer foundation models for digital pathology, trained on over 1 million whole-slide images. Available as Hibou-B (86M) and Hibou-L (307M) under Apache 2.0.

Pathology
46Openness

MedVInT

Shanghai Jiao Tong University +1 other

Released May 17, 2023

347233

A generative medical visual question answering model that aligns a medical vision encoder with a large language model, trained on the 227k-pair PMC-VQA dataset.

PathologyLanguage model
83Openness

Frequently asked questions

What is a computational pathology foundation model?

A computational pathology foundation model is a vision transformer or similar neural network pretrained on large collections of histopathology images — typically whole-slide images or tiles from H&E, IHC, or other stained tissue sections — to learn visual representations that generalize across tissue types, cancer subtypes, and clinical tasks. These models support downstream applications including cancer grading, biomarker prediction, and survival analysis. UNI, Virchow, and CONCH are well-known examples from academic and industry research groups.

How are pathology foundation models trained?

Most current models use self-supervised learning — typically masked image modeling or contrastive objectives like DINO or SimCLR — on hundreds of thousands to millions of image patches sampled from whole-slide images. This avoids the need for dense pixel-level annotations, which are expensive and slow to produce in pathology. Pretraining on large, diverse slide collections covering many tissue types and diseases produces more generalizable representations than models trained on single-cancer or single-institution datasets.

Can pathology foundation models replace human pathologists?

Not in the general case, and that is not the current goal. These models are designed to augment pathologist workflows — handling high-volume screening tasks, flagging slides for review, and quantifying features that are difficult for humans to measure consistently at scale. In specific, well-defined tasks like CAMELYON lymph node metastasis detection, AI models have demonstrated pathologist-level performance in controlled benchmarks, but deployment in clinical practice requires prospective validation and regulatory approval.

What makes pathology models different from general vision models?

Histopathology images have a highly specific statistical structure — gigapixel whole-slide images processed at multiple magnifications, with diagnostic signals distributed across many small regions rather than concentrated in a single salient object. Effective pathology models must aggregate information across spatial scales and tissue regions, making multi-resolution architectures and efficient attention mechanisms particularly important. Pretraining on general image datasets like ImageNet provides some transfer, but models pretrained specifically on pathology images consistently outperform them on histopathology benchmarks.