Central Dogma Transformer

Multimodal architecture coupling pretrained DNA, RNA, and protein language models with directional cross-attention into one Virtual Cell Embedding.

Released: January 2026

The Central Dogma Transformer (CDT) is a mechanism-oriented architecture that tries to model cellular information flow the way molecular biology describes it: DNA is transcribed into RNA, and RNA is translated into protein. Rather than training a single monolithic sequence model, CDT integrates three separate pretrained language models—one each for DNA, RNA, and protein—and connects them with directional cross-attention modules that mirror the central dogma. DNA-to- RNA attention is intended to capture transcriptional regulation, while RNA-to- protein attention captures translational relationships, and the combined signal is distilled into a unified representation the author calls a Virtual Cell Embedding.

CDT was developed and released as a single-author preprint by Nobuyuki Ota in January 2026. It is explicitly framed as a proof-of-concept ("CDT v1") and a step toward mechanism-oriented AI for cellular understanding, rather than a production foundation model. The design philosophy contrasts with purely data-driven multimodal models by hard-wiring the directionality of the central dogma into the attention structure, which the author argues yields more interpretable, biologically grounded representations.

The work sits at the intersection of genomic, transcriptomic, and proteomic language modeling, and is positioned as a bridge between single-modality foundation models (such as DNA, RNA, and protein language models) and the emerging goal of integrated "virtual cell" representations.

Key Features

Directional cross-attention along the central dogma: DNA-to-RNA attention models transcriptional regulation and RNA-to-protein attention models translational relationships, encoding biological directionality into the architecture rather than learning it implicitly.
Frozen backbones, trained connectors: The pretrained DNA, RNA, and protein language models are kept frozen; only the cross-attention modules are trained, keeping the approach lightweight and modular.
Unified Virtual Cell Embedding: The three modalities are fused into a single representation intended to summarize a cell's molecular state.
Built-in interpretability: Attention and gradient analyses provide complementary mechanistic insight, including identification of a CTCF binding site corroborated by Hi-C data.
Honest proof-of-concept scope: CDT v1 uses fixed, non-cell-specific RNA and protein embeddings, a limitation the author states explicitly.

Technical Details

CDT is a transformer-based multimodal model that wires together three frozen pretrained language models with trainable directional cross-attention layers. In the v1 proof of concept, the RNA and protein embeddings are fixed rather than cell-state-specific, so the learned coupling is concentrated in the cross- attention connectors. The model was validated on CRISPRi enhancer perturbation data from K562 cells, where it predicted perturbation effects with a Pearson correlation of 0.503—about 63% of an estimated theoretical ceiling of r = 0.797 set by cross-experiment variability. Interpretability analyses combined attention inspection with gradient attribution; the gradient analysis surfaced a CTCF binding site that was consistent with Hi-C chromatin contact evidence, supporting the claim that the architecture captures biologically meaningful regulatory signal.

Applications

CDT is aimed at researchers interested in modeling regulatory information flow across DNA, RNA, and protein within a single framework, particularly for predicting the effects of genomic perturbations such as enhancer CRISPRi screens. Its Virtual Cell Embedding could serve as a feature representation for downstream functional genomics tasks, and its interpretability tooling makes it useful for hypothesis generation about transcriptional regulation, for example locating candidate regulatory elements like CTCF sites. As a v1 prototype it is best suited to methodological exploration rather than turnkey deployment.

Impact

CDT contributes a biologically structured alternative to generic multimodal fusion by encoding the directionality of the central dogma directly into model attention. Its early validation on K562 enhancer perturbation data and its interpretability results are promising signals for mechanism-oriented modeling of the cell. However, the work is a single-author preprint with a clearly stated proof-of-concept scope, fixed non-cell-specific embeddings in v1, and no public code, weights, or license located at the time of writing—so its broader adoption and influence remain to be demonstrated.

Citation

Central Dogma Transformer: Towards Mechanism-Oriented AI for Cellular Understanding

Preprint

Ota, N. (2026) Central Dogma Transformer: Towards Mechanism-Oriented AI for Cellular Understanding. arXiv.org.

DOI: 10.48550/arXiv.2601.01089

Recent citations

Papers that recently cited this model.

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms
Nobuyuki Ota
arXiv.org · Feb 2026
0

Top citations

The most-cited papers that cite this model.

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms
Nobuyuki Ota
arXiv.org · Feb 2026
0

Citations

Total Citations1

Influential0

References41

Fields of citing research

Biology100%
Computer Science100%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

22Closed

Usability — can I run it?15

Reproducibility — can I retrain it?14

Model Openness Framework

Unclassified

Missing required components

Resources

Research Paper

Key Features

Directional cross-attention along the central dogma: DNA-to-RNA attention models transcriptional regulation and RNA-to-protein attention models translational relationships, encoding biological directionality into the architecture rather than learning it implicitly.

Frozen backbones, trained connectors: The pretrained DNA, RNA, and protein language models are kept frozen; only the cross-attention modules are trained, keeping the approach lightweight and modular.

Unified Virtual Cell Embedding: The three modalities are fused into a single representation intended to summarize a cell's molecular state.

Built-in interpretability: Attention and gradient analyses provide complementary mechanistic insight, including identification of a CTCF binding site corroborated by Hi-C data.

Honest proof-of-concept scope: CDT v1 uses fixed, non-cell-specific RNA and protein embeddings, a limitation the author states explicitly.

Technical Details

Applications

Impact

Central Dogma Transformer

Key Features

Technical Details

Applications

Impact

Citation

Central Dogma Transformer: Towards Mechanism-Oriented AI for Cellular Understanding

Recent citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Top citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Citations

Fields of citing research

Openness

Tags

Resources

Central Dogma Transformer

Key Features

Technical Details

Applications

Impact

Citation

Central Dogma Transformer: Towards Mechanism-Oriented AI for Cellular Understanding

Recent citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Top citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Citations

Fields of citing research

Openness

Tags

Resources

Central Dogma Transformer

#Key Features

#Technical Details

#Applications

#Impact

Citation

Central Dogma Transformer: Towards Mechanism-Oriented AI for Cellular Understanding

Recent citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Top citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Central Dogma Transformer

#Key Features

#Technical Details

#Applications

#Impact

Citation

Central Dogma Transformer: Towards Mechanism-Oriented AI for Cellular Understanding

Recent citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Top citations

Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact