GenoME

Changping Laboratory / Peking University

Mixture-of-Experts generative model turning DNA sequence plus cell-type ATAC-seq into unified epigenomic, transcriptomic, and 3D chromatin profiles.

Released: December 2025

The non-coding genome regulates gene expression through a complex, multiscale system in which cell-type-specific histone modifications, transcription factor binding, and three-dimensional genome conformation all interact. Predicting and interpreting this regulatory logic from sequence alone has remained difficult, because each layer is typically modeled by a separate specialized tool and most predictors do not transfer well to cell types absent from their training data. GenoME, introduced in a December 2025 bioRxiv preprint from Changping Laboratory and Peking University, addresses this by jointly modeling these layers in a single generative framework.

GenoME is a Mixture-of-Experts (MoE) generative model that takes a DNA sequence together with a cell-type-specific chromatin accessibility signal (ATAC-seq, or DNase-seq) and produces a unified genomic profile spanning epigenomics, transcriptomics, and chromatin architecture at resolutions ranging from individual base pairs to kilobases. Crucially, because chromatin accessibility is supplied as an input rather than learned only from a fixed set of training cell types, GenoME can predict the full regulatory landscape of an unseen or individualized cell type from a single ATAC-seq experiment, without retraining.

Beyond prediction, GenoME ships with an in silico perturbation framework for causal interrogation of regulatory function, positioning it as an all-in-one platform for generative modeling, cross-cell-type generalization, and mechanistic investigation of the regulatory genome. It was developed by Jiachen Wei, Yue Xue, Hao Chai, and Yi Qin Gao.

Key Features

Multimodal unified output: From DNA plus a single accessibility track, GenoME jointly predicts epigenomic marks, transcriptomic signal, and 3D chromatin conformation at base-pair to kilobase resolution, rather than treating each modality with a separate model.
Mixture-of-Experts architecture: The MoE design learns specialized, reusable expert functions for different regulatory concepts, which the authors credit for robust generalization to new cellular contexts.
Cross-cell-type generalization: By conditioning on cell-type-specific ATAC-seq as input, GenoME extends predictions to unseen or individualized cell types from a single accessibility experiment, without model retraining.
In silico perturbation: A built-in perturbation framework forecasts the multimodal consequences of genetic and accessibility perturbations, enabling causal study of regulatory mechanisms.
Enhancer-promoter mapping: GenoME identifies functional enhancer-promoter connections and, per the preprint, outperforms the specialized Activity-by-Contact model on this task, while also helping decipher the transcription factor grammar of cell-type-specific enhancers.

Technical Details

GenoME is a generative model built on a Mixture-of-Experts framework. Its inputs are a DNA sequence and a matched cell-type-specific chromatin accessibility profile (ATAC-seq or DNase-seq); its output is a unified, multiscale prediction covering epigenomic, transcriptomic, and 3D-conformation modalities at native base-pair-to-kilobase resolutions. The model is evaluated on held-out genomic regions to assess sequence-level generalization and on held-out cell types to assess cross-context transfer driven by the accessibility input. For regulatory inference, the authors report that GenoME's perturbation-based enhancer-promoter predictions exceed the performance of Activity-by-Contact, a widely used heuristic for linking enhancers to target genes. Exact parameter counts, the number of experts, training-corpus composition, and full benchmark tables are described in the preprint; specific figures are not reproduced here because they could not be independently verified from the abstract and indexed metadata.

Applications

GenoME is aimed at researchers studying gene regulation, functional genomics, and the interpretation of non-coding variation. Because it produces a complete regulatory profile for a cell type from a single ATAC-seq input, it is useful for characterizing rare, patient-derived, or otherwise data-sparse cellular contexts where comprehensive multi-omic profiling is impractical. Its perturbation framework supports prioritizing candidate regulatory variants, mapping enhancers to their target genes, and dissecting transcription factor grammar, tasks relevant to disease-variant interpretation, enhancer annotation, and the design of cell-type-specific regulatory hypotheses for downstream experimental validation.

Impact

GenoME contributes to a growing class of sequence-to-function models that move beyond single-modality prediction toward unified, conditionable representations of the regulatory genome. Its central design choice, supplying cell-type chromatin accessibility as an input so the model generalizes to unseen cell types without retraining, addresses a recurring limitation of fixed-cell-type predictors and aligns it conceptually with recent multimodal genomic foundation models. As a December 2025 preprint released under a CC BY-NC-ND license, its long-term influence and independent benchmarking remain to be established; no public code repository or model and data cards were located at the time of writing, which currently limits external reproduction and adoption.

Citation

GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles

Wei, J., et al. (2025) GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles. bioRxiv.

DOI: 10.64898/2025.12.28.696482

Recent citations

Papers that recently cited this model.

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations
Bo Fu, George Dasoulas, Sameer Gabbita, et al.
arXiv.org · Feb 2026
2

Top citations

The most-cited papers that cite this model.

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations
Bo Fu, George Dasoulas, Sameer Gabbita, et al.
arXiv.org · Feb 2026
2

Citations

Total Citations1

Influential0

References64

Fields of citing research

Biology100%
Computer Science100%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

8Closed

Usability — can I run it?7

Reproducibility — can I retrain it?10

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

Multimodal unified output: From DNA plus a single accessibility track, GenoME jointly predicts epigenomic marks, transcriptomic signal, and 3D chromatin conformation at base-pair to kilobase resolution, rather than treating each modality with a separate model.

Mixture-of-Experts architecture: The MoE design learns specialized, reusable expert functions for different regulatory concepts, which the authors credit for robust generalization to new cellular contexts.

Cross-cell-type generalization: By conditioning on cell-type-specific ATAC-seq as input, GenoME extends predictions to unseen or individualized cell types from a single accessibility experiment, without model retraining.

In silico perturbation: A built-in perturbation framework forecasts the multimodal consequences of genetic and accessibility perturbations, enabling causal study of regulatory mechanisms.

Enhancer-promoter mapping: GenoME identifies functional enhancer-promoter connections and, per the preprint, outperforms the specialized Activity-by-Contact model on this task, while also helping decipher the transcription factor grammar of cell-type-specific enhancers.

Technical Details

Applications

Impact

Citation

GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles

Wei, J., et al. (2025) GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles. bioRxiv.

DOI: 10.64898/2025.12.28.696482

GenoME

Key Features

Technical Details

Applications

Impact

Citation

GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles

Recent citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Top citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Citations

Fields of citing research

Openness

Tags

Resources

GenoME

Key Features

Technical Details

Applications

Impact

Citation

GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles

Recent citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Top citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Citations

Fields of citing research

Openness

Tags

Resources

GenoME

#Key Features

#Technical Details

#Applications

#Impact

Citation

GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles

Recent citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Top citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

GenoME

#Key Features

#Technical Details

#Applications

#Impact

Citation

GenoME: a MoE-based generative model for individualized, multimodal prediction and perturbation of genomic profiles

Recent citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Top citations

STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact