RudolfV

Aignostics / TU Berlin / BIFOLD / Charité – Universitätsmedizin Berlin / German Cancer Research Center (DKFZ) / LMU Munich / Korea University / Max Planck Institute for Informatics

Self-supervised pathology foundation model with a 300M-parameter vision transformer tile encoder, trained on a multi-stain whole-slide image corpus.

Released: January 2024

Parameters: 300 Million

RudolfV is a self-supervised foundation model for computational pathology developed by Aignostics together with academic collaborators including TU Berlin, BIFOLD, Charité – Universitätsmedizin Berlin, the German Cancer Research Center (DKFZ), and LMU Munich. Introduced in January 2024, the model is named after Rudolf Virchow, a founder of modern pathology, reflecting its design philosophy: a foundation model built "by pathologists for pathologists," in which domain expertise guided data curation and evaluation rather than relying purely on scale.

Histopathology slides are extremely heterogeneous, spanning many tissue types, disease entities, staining protocols, and scanner vendors, and most computational pathology models struggle to generalize across this variation or to handle rare diseases. RudolfV addresses this by combining a large, deliberately diverse training corpus with self-supervised pretraining, producing a general-purpose tile encoder whose embeddings transfer to a wide range of downstream diagnostic and biomarker tasks.

The model sits alongside other pathology foundation models such as UNI, Virchow, H-optimus-0, and Hibou, and was among the early efforts to emphasize stain and laboratory diversity as a deliberate design axis rather than simply maximizing the number of hematoxylin-and-eosin slides.

Key Features

Pathologist-guided curation: Data selection and the evaluation framework were shaped by practicing pathologists, prioritizing tissue, disease, and staining diversity over raw slide count.
Multi-stain coverage: Training data spans roughly 97 unique staining types, including H&E (about 70%), immunohistochemistry (about 10%), and other histochemical stains, broadening applicability beyond H&E-only models.
Cross-laboratory diversity: Slides were sourced from more than 15 laboratories across the EU and US, covering 58 tissue types to improve robustness to scanner and protocol variation.
Strong downstream transfer: As a frozen feature extractor, RudolfV matches or surpasses contemporary foundation models on tile-level benchmarks and tumor microenvironment and biomarker tasks.

Technical Details

RudolfV is a Vision Transformer (ViT-L/14, roughly 300 million parameters) pretrained with a DINOv2-style self-supervised objective. The training adaptation samples a specific distribution over slide groups and tissue clusters and extends the standard augmentation pipeline with stain variations to encourage stain-invariant representations. The corpus comprises 103,849 whole-slide images from 35,784 cases, from which about 791 million tiles were extracted and roughly 751 million retained after filtering. Pretraining used a batch size of 960 on 16 A100-40GB GPUs for 625,000 iterations. Evaluated as a frozen encoder across benchmarks including PCam, MHIST, CRC-100K, MSI prediction in colorectal and gastric cancer, and tumor-infiltrating-lymphocyte detection, RudolfV reports competitive or state-of-the-art performance relative to other foundation models of its era while using comparatively fewer slides.

Applications

RudolfV serves as a backbone for computational pathology workflows in both clinical research and biopharma. Its embeddings support tasks such as cancer subtyping, nuclear and tissue segmentation, microsatellite-instability and other biomarker prediction, and tumor microenvironment profiling, typically by training lightweight heads on the frozen features. Aignostics has described RudolfV as the base model underlying its histopathology product work, making it relevant to diagnostic-support tooling, translational research, and clinical-trial biomarker analysis.

Impact

RudolfV helped establish data diversity and pathologist-informed curation, rather than slide count alone, as decisive factors for pathology foundation models, demonstrating competitive performance from a curated multi-stain corpus. It serves as the predecessor to Aignostics' later Atlas model (developed with Mayo Clinic and Charité) and is frequently cited in surveys of computational pathology foundation models. Practical adoption outside Aignostics is constrained by access terms: the work is released under a CC BY-NC-ND 4.0 license, and weights are distributed through Aignostics rather than as a fully open release, which limits broad academic reuse compared with openly licensed alternatives.

Citation

RudolfV: A Foundation Model by Pathologists for Pathologists

Preprint

Dippel, J., et al. (2024) RudolfV: A Foundation Model by Pathologists for Pathologists. arXiv.org.

DOI: 10.48550/arXiv.2401.04079

Recent citations

Papers that recently cited this model.

Mitigating Batch Effects in Histopathology via Language-Mediated Robust Embedding Generation
Yishu Zhang, Shushan Wu, Zhen-Ze Zhang, et al.
Jun 2026
0
Towards robust foundation models for digital pathology
Jonah Kömen, Edwin D. de Jong, Julius Hense, et al.
Nature Communications · Jun 2026
20
Atlas H&E-TME: Scalable AI-Based Tissue Profiling at Expert Pathologist-Level Accuracy
K. Standvoss, Miriam Hagele, R. Krupar, et al.
Jun 2026
0Influential

Top citations

The most-cited papers that cite this model.

A foundation model for clinical-grade computational pathology and rare cancers detection
E. Vorontsov, A. Bozkurt, Adam Casson, et al.
Nature Medicine · Jul 2024
485
Virchow2: Scaling Self-Supervised Mixed Magnification Models in Pathology
Eric Zimmermann, E. Vorontsov, Julian Viret, et al.
arXiv.org · Aug 2024
202Influential
Virchow: A Million-Slide Digital Pathology Foundation Model
E. Vorontsov, A. Bozkurt, Adam Casson, et al.
arXiv.org · Sep 2023
142Influential
A clinical benchmark of public self-supervised pathology foundation models
Gabriele Campanella, Shengjia Chen, Ruchika Verma, et al.
Nature Communications · Jul 2024
120
PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology
George Shaikovski, Adam Casson, Kristen Severson, et al.
arXiv.org · May 2024
92Influential

Citations

Total Citations74

Influential9

References72

Fields of citing research

Computer Science96%
Medicine93%
Biology19%
Engineering13%
Mathematics1%
Philosophy1%
Geology1%
Physics1%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

9Closed

Usability — can I run it?9

Reproducibility — can I retrain it?5

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper Official Website

Key Features

Pathologist-guided curation: Data selection and the evaluation framework were shaped by practicing pathologists, prioritizing tissue, disease, and staining diversity over raw slide count.

Multi-stain coverage: Training data spans roughly 97 unique staining types, including H&E (about 70%), immunohistochemistry (about 10%), and other histochemical stains, broadening applicability beyond H&E-only models.

Cross-laboratory diversity: Slides were sourced from more than 15 laboratories across the EU and US, covering 58 tissue types to improve robustness to scanner and protocol variation.

Strong downstream transfer: As a frozen feature extractor, RudolfV matches or surpasses contemporary foundation models on tile-level benchmarks and tumor microenvironment and biomarker tasks.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

Mitigating Batch Effects in Histopathology via Language-Mediated Robust Embedding Generation

Yishu Zhang, Shushan Wu, Zhen-Ze Zhang, et al.

Jun 2026

Towards robust foundation models for digital pathology

Jonah Kömen, Edwin D. de Jong, Julius Hense, et al.

Nature Communications · Jun 2026

Atlas H&E-TME: Scalable AI-Based Tissue Profiling at Expert Pathologist-Level Accuracy

K. Standvoss, Miriam Hagele, R. Krupar, et al.

Jun 2026

0Influential

RudolfV

#Key Features

#Technical Details

#Applications

#Impact

Citation

RudolfV: A Foundation Model by Pathologists for Pathologists

Recent citations

Mitigating Batch Effects in Histopathology via Language-Mediated Robust Embedding Generation

Atlas H&E-TME: Scalable AI-Based Tissue Profiling at Expert Pathologist-Level Accuracy

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

RudolfV

#Key Features

#Technical Details

#Applications

#Impact

Citation

RudolfV: A Foundation Model by Pathologists for Pathologists

Recent citations

Mitigating Batch Effects in Histopathology via Language-Mediated Robust Embedding Generation

Atlas H&E-TME: Scalable AI-Based Tissue Profiling at Expert Pathologist-Level Accuracy

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact