FOCUS

Generative foundation model that imputes genes and denoises spatial transcriptomics, conditioned on H&E histology, scRNA-seq, and spatial priors.

Released: December 2025

FOCUS is a generative foundation model for enhancing spatial transcriptomics (ST) data, developed by researchers in the Department of Clinical Neurosciences at the University of Cambridge and released as a bioRxiv preprint in December 2025. Spatial transcriptomics measures gene expression while preserving the physical location of cells within a tissue, but real ST data is limited by platform-dependent trade-offs between resolution, gene-panel breadth, capture sensitivity, and dropout. FOCUS addresses these limitations by learning to reconstruct and enhance ST measurements rather than treating each platform's output as a fixed observation.

What distinguishes FOCUS is that it conditions enhancement on three complementary sources of biological signal at once: the paired hematoxylin-and-eosin (H&E) histology image, a single-cell RNA-seq (scRNA-seq) reference of the relevant tissue, and spatial co-expression priors that capture how genes covary across neighboring locations. By fusing morphology, a single-cell expression reference, and spatial structure, the model can impute missing genes, denoise sparse counts, and increase effective resolution in a way that is anchored to both tissue appearance and known cell-state biology.

FOCUS is positioned as a unified, platform-agnostic model. It is trained across many ST technologies and is reported to generalize to unseen platforms and rare disease tissues in a zero-shot setting, making it a general-purpose enhancement layer that sits between raw spatial assays and downstream analyses such as cell typing and spatial domain discovery.

Key Features

Multimodal conditioning: Jointly conditions on H&E images, an scRNA-seq reference, and spatial co-expression priors, integrating tissue morphology, single-cell expression, and neighborhood structure into a single generative process.
Platform-agnostic enhancement: Trained across a wide range of spatial transcriptomics technologies and reported to achieve state-of-the-art results on 10 ST platforms, providing one model rather than platform-specific tools.
Zero-shot generalization: Transfers to an unseen platform (Open-ST) and to rare tumor datasets, including craniopharyngioma and head-and-neck squamous cell carcinoma (HNSCC), without retraining.
Histology-grounded imputation: Uses the paired H&E image as a dense, high-resolution prior, allowing expression enhancement to respect tissue boundaries and morphological context.

Technical Details

FOCUS is a generative model trained on a large multimodal corpus of paired histology and spatial transcriptomics data — more than 1.7 million H&E–ST pairs together with over 5.8 million single-cell expression profiles. This scale of paired supervision lets the model learn cross-modal relationships between tissue appearance and gene expression as well as the spatial co-expression structure that links neighboring measurements. The conditioning design enables enhancement tasks such as gene imputation, denoising, and resolution improvement to draw simultaneously on morphology (from H&E), reference cell-state distributions (from scRNA-seq), and spatial priors. The authors report state-of-the-art performance across 10 spatial transcriptomics platforms and demonstrate zero-shot application to the Open-ST platform and to rare craniopharyngioma and HNSCC datasets that were not part of training. As of the preprint, no public code or trained weights have been released, which currently limits independent benchmarking and deployment.

Applications

FOCUS targets researchers and pathologists working with spatial transcriptomics across basic, translational, and clinical settings. By imputing unmeasured genes and denoising sparse signal, it can extend limited gene panels, recover expression in low-capture regions, and raise effective resolution — improving downstream cell-type annotation, spatial domain delineation, and ligand-receptor analysis. Its histology grounding is especially valuable in oncology and neuropathology, where paired H&E slides are routine and where rare tumor entities (such as craniopharyngioma) provide too little data to train bespoke models. Because it generalizes zero-shot to new platforms, FOCUS can also serve as a common enhancement step in pipelines that combine data from heterogeneous spatial assays.

Impact

FOCUS reflects a growing trend toward large, multimodal foundation models that bridge digital pathology and spatial omics, joining a wave of models that learn jointly from histology images and molecular measurements. Its emphasis on enhancement — rather than de novo prediction — and its reported ability to generalize across platforms and to rare disease tissues address two persistent pain points in the ST field: data sparsity and platform fragmentation. The main limitations are that the work is a preprint, that the strongest claims (state-of-the-art across 10 platforms, zero-shot transfer) await independent confirmation, and that the absence of released code or weights makes the model difficult to reproduce or use in practice at this time.

Citation

A Foundational Generative Model for Cross-platform Unified Enhancement of Spatial Transcriptomics

Wang, X., et al. (2026) A Foundational Generative Model for Cross-platform Unified Enhancement of Spatial Transcriptomics. bioRxiv.

DOI: 10.64898/2025.12.23.696267

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations0

Influential0

References0

Fields of citing research

Not enough data

Openness

bio.rodeo opennessClosed · low usability and reproducibility

4Closed

Usability — can I run it?7

Reproducibility — can I retrain it?0

not reproducible

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

Multimodal conditioning: Jointly conditions on H&E images, an scRNA-seq reference, and spatial co-expression priors, integrating tissue morphology, single-cell expression, and neighborhood structure into a single generative process.

Platform-agnostic enhancement: Trained across a wide range of spatial transcriptomics technologies and reported to achieve state-of-the-art results on 10 ST platforms, providing one model rather than platform-specific tools.

Zero-shot generalization: Transfers to an unseen platform (Open-ST) and to rare tumor datasets, including craniopharyngioma and head-and-neck squamous cell carcinoma (HNSCC), without retraining.

Histology-grounded imputation: Uses the paired H&E image as a dense, high-resolution prior, allowing expression enhancement to respect tissue boundaries and morphological context.

Technical Details

Applications

Impact

FOCUS

Key Features

Technical Details

Applications

Impact

Citation

A Foundational Generative Model for Cross-platform Unified Enhancement of Spatial Transcriptomics

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

FOCUS

Key Features

Technical Details

Applications

Impact

Citation

A Foundational Generative Model for Cross-platform Unified Enhancement of Spatial Transcriptomics

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

FOCUS

#Key Features

#Technical Details

#Applications

#Impact

Citation

A Foundational Generative Model for Cross-platform Unified Enhancement of Spatial Transcriptomics

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

FOCUS

#Key Features

#Technical Details

#Applications

#Impact

Citation

A Foundational Generative Model for Cross-platform Unified Enhancement of Spatial Transcriptomics

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact