DoFormer

Columbia University / Biohub / Chan Zuckerberg Biohub New York

Causal multimodal transformer that embeds the do-operator in attention to predict single-cell gene expression under unseen genetic perturbations.

Released: May 2026

DoFormer is a causal multimodal Transformer for predicting how single cells respond to genetic perturbations. Single-cell transcriptomics combined with high-throughput perturbation screens (such as Perturb-seq) can measure the effects of knocking out or activating individual genes, but the combinatorial space of possible perturbations is far larger than any experiment can cover. The central challenge is to predict the transcriptional consequences of perturbations that were never directly measured, including in cell types or contexts outside the training data. DoFormer addresses this by learning a generalizable map from a perturbation to its downstream effect on gene expression.

The model's defining idea is to embed Pearl's causal do-operator directly into the attention mechanism, allowing the network to distinguish observational data (what is seen in unperturbed cells) from interventional data (what happens when a gene is experimentally forced to a new state). Rather than requiring an explicit causal graph (DAG) to be specified or inferred in advance, DoFormer represents interventions natively within its architecture, sidestepping the strong structural assumptions that limit many causal-inference methods in genomics. It is trained once on broad perturbational scRNA-seq data and then applied to new inputs, supporting in-silico perturbation prediction for previously unseen perturbations.

DoFormer was introduced in a 2026 bioRxiv preprint by Karbalayghareh, Paull, and Califano from the Califano lab at Columbia University, affiliated with the Chan Zuckerberg Biohub New York.

Key Features

Causal do-operator in attention: The intervention semantics of the do-operator are built into the attention layers, letting the model reason about interventional regimes rather than only correlational structure.
No DAG assumptions: Unlike many causal methods, DoFormer does not require a predefined or inferred causal graph, avoiding a brittle and often intractable structure-learning step.
Unseen perturbation prediction: The model generalizes to perturbations not present in the training set, enabling in-silico screening across a far larger space than was experimentally measured.
Multimodal design: It integrates observational and interventional signals in a single architecture, distinguishing the two regimes explicitly.
Train-once, apply-broadly: A single model trained on broad perturbational scRNA-seq is applied to new inputs without per-dataset retraining.

Technical Details

DoFormer is a Transformer architecture whose attention is modified to encode the causal do-operator, separating observational from interventional conditioning. It is trained on broad perturbational single-cell RNA-seq data that pairs applied gene perturbations with their measured transcriptional outcomes, and learns to predict expression responses to new perturbations at single-cell resolution. According to the preprint, DoFormer outperforms established single-cell and perturbation-modeling baselines including scGPT, Geneformer, and GEARS on perturbation-response prediction tasks. As a bioRxiv preprint (released 2026-05-04, CC BY-NC 4.0), these results have not yet undergone peer review, and detailed hyperparameters and training-corpus composition should be confirmed against the source.

Applications

DoFormer is aimed at computational and experimental biologists who use perturbation screens to dissect gene-regulatory networks and disease mechanisms. By predicting the effects of perturbations that have not been experimentally tested, it can prioritize candidate targets, guide the design of follow-up Perturb-seq experiments, and support in-silico exploration of intervention effects across cell states. This is particularly valuable for target discovery and mechanistic studies where exhaustive experimental coverage of the perturbation space is infeasible.

Impact

By formalizing interventions through the do-operator inside a Transformer, DoFormer offers a causally grounded alternative to purely correlational single-cell foundation models for perturbation prediction. Its reported gains over scGPT, Geneformer, and GEARS suggest that explicitly modeling the observational/interventional distinction can improve generalization to unseen perturbations. As of this writing the work is a preprint with no public code or weights confirmed, and it is released under a non-commercial license (CC BY-NC 4.0), so independent reproduction and broader adoption remain to be established.

Citation

DoFormer: Causal Transformer for Gene Perturbation

Karbalayghareh, A., et al. (2026) DoFormer: Causal Transformer for Gene Perturbation. bioRxiv.

DOI: 10.64898/2026.05.02.722054

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations168

Influential19

References66

Fields of citing research

Not enough data

Openness

bio.rodeo opennessClosed · low usability and reproducibility

8Closed

Usability — can I run it?7

Reproducibility — can I retrain it?10

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

Causal do-operator in attention: The intervention semantics of the do-operator are built into the attention layers, letting the model reason about interventional regimes rather than only correlational structure.

No DAG assumptions: Unlike many causal methods, DoFormer does not require a predefined or inferred causal graph, avoiding a brittle and often intractable structure-learning step.

Unseen perturbation prediction: The model generalizes to perturbations not present in the training set, enabling in-silico screening across a far larger space than was experimentally measured.

Multimodal design: It integrates observational and interventional signals in a single architecture, distinguishing the two regimes explicitly.

Train-once, apply-broadly: A single model trained on broad perturbational scRNA-seq is applied to new inputs without per-dataset retraining.

Technical Details

Applications

Impact

DoFormer

Key Features

Technical Details

Applications

Impact

Citation

DoFormer: Causal Transformer for Gene Perturbation

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

DoFormer

Key Features

Technical Details

Applications

Impact

Citation

DoFormer: Causal Transformer for Gene Perturbation

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

DoFormer

#Key Features

#Technical Details

#Applications

#Impact

Citation

DoFormer: Causal Transformer for Gene Perturbation

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

DoFormer

#Key Features

#Technical Details

#Applications

#Impact

Citation

DoFormer: Causal Transformer for Gene Perturbation

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact