TMEformer

Spatial transcriptomics foundation model for the tumor microenvironment, giving TME-aware embeddings and in silico perturbation from one checkpoint.

Released: May 2026

TMEformer is a spatial-transcriptomics foundation model built specifically for the tumor microenvironment (TME), introduced in a bioRxiv preprint posted in May 2026 by a group led by Sujun Chen at Sichuan University. Rather than treating spatial transcriptomics as a generic spatial-omics modality, TMEformer is pretrained on a curated corpus of cancer spatial-transcriptomics datasets and learns representations that capture the spatial dependencies among malignant cells, immune populations, stroma, and vasculature that define how a tumor is organized.

The central problem the model addresses is that the TME is a system of interacting cell populations whose behavior cannot be read off from any single spot in isolation. By learning TME-aware embeddings across many tumors, TMEformer aims to provide a transferable representation of microenvironmental context. Its most distinctive capability is in silico virtual perturbation: from a single fixed pretrained checkpoint, the model can be queried to predict how the microenvironment shifts under hypothetical interventions, without task-specific retraining.

TMEformer joins a growing class of spatial foundation models, but is unusual in being scoped tightly to oncology and in foregrounding perturbation-style inference as a primary use case rather than a downstream add-on.

Key Features

TME-aware embeddings: Pretraining on cancer spatial-transcriptomics data yields representations that encode the spatial relationships among the cell populations that constitute the tumor microenvironment.
In silico perturbation: The model supports virtual perturbation queries from a fixed pretrained checkpoint, letting researchers probe predicted microenvironmental responses computationally rather than experimentally.
Checkpoint reuse without retraining: A single pretrained checkpoint is applied directly to new analyses, avoiding per-dataset or per-task retraining.
Cross-cohort validation: The framework was evaluated across independent tumor cohorts without retraining, providing evidence of transfer beyond the pretraining corpus.
Oncology-focused corpus: Training on a curated collection of cancer spatial-transcriptomics datasets specializes the model for tumor biology rather than general tissue contexts.

Technical Details

TMEformer is described as a transformer-based spatial foundation framework; the preprint does not state a precise architecture configuration or parameter count, so these details remain unspecified here pending fuller disclosure. The model is pretrained on a curated corpus of cancer spatial-transcriptomics datasets and produces embeddings used for downstream analysis and for in silico perturbation. Validation reported in the preprint spans multiple independent tumor cohorts, with the same fixed checkpoint applied across cohorts rather than fine-tuned for each. As of this writing, no public code or model weights have been confirmed, and the preprint is released under a CC BY-NC-ND 4.0 license (non-commercial, no derivatives). Because the work is a non-peer-reviewed preprint, its benchmark claims should be read as preliminary.

Applications

TMEformer is aimed at cancer-biology and computational-oncology researchers working with spatial transcriptomics who want a microenvironment-aware representation that transfers across tumor cohorts. The in silico perturbation capability is intended to support hypothesis generation about how the tumor microenvironment might respond to interventions, helping prioritize experiments before committing wet-lab resources. Potential use cases include characterizing immune-stromal-tumor spatial organization, comparing microenvironmental states across patients, and screening candidate perturbations computationally.

Impact

By framing the tumor microenvironment itself as the modeling target and pairing TME-aware embeddings with checkpoint-level in silico perturbation, TMEformer reflects a broader shift toward spatial foundation models that support counterfactual reasoning rather than static annotation alone. Its demonstrated transfer across independent tumor cohorts without retraining is a meaningful signal for reusability in oncology spatial analysis. That said, real-world impact is currently constrained: the model is a recent preprint with no confirmed public code or weights and a non-commercial, no-derivatives license, so independent reproduction and adoption remain to be established.

Citation

Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation

Li, S., et al. (2026) Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation. bioRxiv.

DOI: 10.64898/2026.05.17.725770

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations0

Influential0

References20

Fields of citing research

Not enough data

Openness

bio.rodeo opennessClosed · low usability and reproducibility

10Closed

Usability — can I run it?7

Reproducibility — can I retrain it?13

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

TME-aware embeddings: Pretraining on cancer spatial-transcriptomics data yields representations that encode the spatial relationships among the cell populations that constitute the tumor microenvironment.

In silico perturbation: The model supports virtual perturbation queries from a fixed pretrained checkpoint, letting researchers probe predicted microenvironmental responses computationally rather than experimentally.

Checkpoint reuse without retraining: A single pretrained checkpoint is applied directly to new analyses, avoiding per-dataset or per-task retraining.

Cross-cohort validation: The framework was evaluated across independent tumor cohorts without retraining, providing evidence of transfer beyond the pretraining corpus.

Oncology-focused corpus: Training on a curated collection of cancer spatial-transcriptomics datasets specializes the model for tumor biology rather than general tissue contexts.

Technical Details

Applications

Impact

Citation

Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation

Li, S., et al. (2026) Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation. bioRxiv.

DOI: 10.64898/2026.05.17.725770

TMEformer

Key Features

Technical Details

Applications

Impact

Citation

Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

TMEformer

Key Features

Technical Details

Applications

Impact

Citation

Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

TMEformer

#Key Features

#Technical Details

#Applications

#Impact

Citation

Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

TMEformer

#Key Features

#Technical Details

#Applications

#Impact

Citation

Mapping Tumor-Microenvironment dependencies with TMEformer: A spatial foundation framework enabling in silico perturbation

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact