A spatial-transcriptomics foundation model for the tumor microenvironment that produces TME-aware embeddings and enables in silico perturbation from a fixed pretrained checkpoint.
TMEformer is a spatial-transcriptomics foundation model built specifically for the tumor microenvironment (TME), introduced in a bioRxiv preprint posted in May 2026 by a group led by Sujun Chen at Sichuan University. Rather than treating spatial transcriptomics as a generic spatial-omics modality, TMEformer is pretrained on a curated corpus of cancer spatial-transcriptomics datasets and learns representations that capture the spatial dependencies among malignant cells, immune populations, stroma, and vasculature that define how a tumor is organized.
The central problem the model addresses is that the TME is a system of interacting cell populations whose behavior cannot be read off from any single spot in isolation. By learning TME-aware embeddings across many tumors, TMEformer aims to provide a transferable representation of microenvironmental context. Its most distinctive capability is in silico virtual perturbation: from a single fixed pretrained checkpoint, the model can be queried to predict how the microenvironment shifts under hypothetical interventions, without task-specific retraining.
TMEformer joins a growing class of spatial foundation models, but is unusual in being scoped tightly to oncology and in foregrounding perturbation-style inference as a primary use case rather than a downstream add-on.
TMEformer is described as a transformer-based spatial foundation framework; the preprint does not state a precise architecture configuration or parameter count, so these details remain unspecified here pending fuller disclosure. The model is pretrained on a curated corpus of cancer spatial-transcriptomics datasets and produces embeddings used for downstream analysis and for in silico perturbation. Validation reported in the preprint spans multiple independent tumor cohorts, with the same fixed checkpoint applied across cohorts rather than fine-tuned for each. As of this writing, no public code or model weights have been confirmed, and the preprint is released under a CC BY-NC-ND 4.0 license (non-commercial, no derivatives). Because the work is a non-peer-reviewed preprint, its benchmark claims should be read as preliminary.
TMEformer is aimed at cancer-biology and computational-oncology researchers working with spatial transcriptomics who want a microenvironment-aware representation that transfers across tumor cohorts. The in silico perturbation capability is intended to support hypothesis generation about how the tumor microenvironment might respond to interventions, helping prioritize experiments before committing wet-lab resources. Potential use cases include characterizing immune-stromal-tumor spatial organization, comparing microenvironmental states across patients, and screening candidate perturbations computationally.
By framing the tumor microenvironment itself as the modeling target and pairing TME-aware embeddings with checkpoint-level in silico perturbation, TMEformer reflects a broader shift toward spatial foundation models that support counterfactual reasoning rather than static annotation alone. Its demonstrated transfer across independent tumor cohorts without retraining is a meaningful signal for reusability in oncology spatial analysis. That said, real-world impact is currently constrained: the model is a recent preprint with no confirmed public code or weights and a non-commercial, no-derivatives license, so independent reproduction and adoption remain to be established.
Li, S., et al. (2026) Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation. bioRxiv.
DOI: 10.64898/2026.05.17.725770