University of Texas at Arlington / St. Jude Children's Research Hospital
Foundation-model-guided virtual staining that generates four IHC markers (HER2, Ki67, ER, PR) from H&E using a single SPADE-UNet conditioned on a frozen UNI encoder.
UNIStainNet is a virtual staining model that computationally generates immunohistochemistry (IHC) stains directly from routine hematoxylin and eosin (H&E) microscopy images. Whereas conventional IHC requires additional tissue sections, antibody reagents, and laboratory turnaround for each marker, UNIStainNet predicts the appearance of stained tissue in silico, offering preliminary molecular insight from a single H&E section. It was developed by the Luber Lab at the University of Texas at Arlington in collaboration with St. Jude Children's Research Hospital, and released as a preprint in March 2026.
The model's central idea is to guide image-to-image translation with the semantic priors of a pathology foundation model. Rather than learning tissue structure from scratch, the generator is conditioned on dense spatial tokens extracted from a frozen UNI encoder (a ViT-L/16 self-supervised pathology foundation model from the Mahmood Lab). These tokens supply tissue-level semantic context that helps the generator produce stains that are spatially coherent and biologically plausible.
A distinguishing feature of UNIStainNet is that a single unified model handles four clinically important breast-cancer biomarkers — HER2, Ki67, ER, and PR — through learned stain embeddings, rather than training a separate network per marker. This consolidates what would otherwise be four models into one, simplifying deployment while maintaining state-of-the-art distributional fidelity across all four stains simultaneously.
The generator is a SPADE-UNet with approximately 42 million trainable parameters (excluding the frozen UNI backbone), trained adversarially against an 11M-parameter multi-scale PatchGAN discriminator. Conditioning combines dense UNI spatial tokens with FiLM-based stain embeddings and an edge encoder. The model was trained and evaluated on the MIST dataset (four IHC markers at 512×512) and the BCI dataset (HER2, paired H&E/IHC at 1024×1024). On the MIST four-stain unified benchmark it reports FID of 27.2–34.5 and SSIM of 0.23–0.28 across stains (e.g., Ki67 FID 27.2, SSIM 0.282), and on BCI it reports FID 34.6 and SSIM 0.541. At 1024×1024 resolution the model achieves a Pearson correlation of 0.961 with a low DAB KL divergence (0.099) on MIST, indicating faithful preservation of stain intensity distributions. Evaluation uses FID, KID, LPIPS, SSIM, and DAB-intensity metrics.
UNIStainNet targets computational pathology workflows in oncology, where it can provide rapid, low-cost preliminary readouts of breast-cancer biomarkers from standard H&E slides without consuming additional tissue or reagents. Potential beneficiaries include pathologists triaging cases for downstream molecular workup, research groups studying biomarker spatial distributions across large slide cohorts, and resource-limited settings where physical IHC is costly or slow. As a research tool it supports method development in virtual staining and benchmarking of foundation-model-guided image translation.
UNIStainNet contributes to a growing line of work showing that pathology foundation models such as UNI can serve not only as feature extractors for classification but also as semantic guidance for generative tasks. Demonstrating that a single conditioned generator can match state-of-the-art per-stain performance across four markers is a practical step toward consolidated, deployable virtual-staining systems. As a recent preprint, its clinical utility remains to be validated; virtual stains are not a substitute for physical IHC in diagnosis, and reported gains are measured against distributional and intensity metrics rather than pathologist-graded scoring. The training code and a public demo are available, though pretrained UNIStainNet weights were not released in the repository at the time of writing, and use of the UNI backbone is subject to its own license.