MAP

Knowledge-graph-grounded model that predicts single-cell transcriptomic responses to small molecules, with zero-shot prediction for unprofiled drugs.

Released: February 2026

Predicting how individual cells respond to a drug is central to mechanistic pharmacology and the prioritization of candidate compounds. Existing single-cell perturbation models learn well for molecules with abundant measured profiles, but they generalize poorly to unprofiled drugs because they treat each compound as an isolated identifier, ignoring the mechanistic relationships that connect drugs, their targets, and downstream genes. MAP, introduced in a February 2026 bioRxiv preprint from Shanghai Jiao Tong University, addresses this gap by injecting structured biological knowledge into the perturbation-modeling problem.

The core idea is to ground drug and gene representations in a large knowledge graph before they are used to predict expression changes. MAP first constructs MAP-KG, a purpose-built knowledge graph for cellular perturbation modeling, then pre-trains multimodal embeddings that align a compound's chemical structure, its target proteins, and textual descriptions of its mechanism of action. These knowledge-grounded embeddings let the model reason about a new drug by analogy to mechanistically related compounds it has seen, enabling zero-shot response prediction for molecules with scarce or absent profiles.

Key Features

Knowledge-grounded drug and gene embeddings: Compound and gene representations are pre-trained on a dedicated knowledge graph rather than learned from perturbation data alone, encoding mechanistic context that supports generalization.
Zero-shot prediction for unprofiled drugs: Because embeddings capture mechanism, MAP can predict single-cell responses for small molecules that have no measured perturbation profile in the training data.
Multimodal alignment: A contrastive pre-training strategy aligns molecular structure, protein-sequence features of targets, and natural-language mechanistic descriptions into a shared embedding space.
Purpose-built knowledge graph (MAP-KG): Unifies 14 public resources spanning roughly 187,000 drugs, 23,000 genes, and 694,000 mechanistic relationships.

Technical Details

MAP is built around MAP-KG, a knowledge graph assembled from 14 public databases that covers approximately 187k drugs, 23k genes, and 694k mechanistic relationships (such as drug-target and gene-gene interactions). The framework uses a knowledge-driven pre-training stage in which contrastive learning aligns three modalities for each compound—molecular structure, protein-sequence features of its targets, and textual mechanistic descriptions—into a unified embedding space. These embeddings are then used to predict single-cell expression responses, with the graph-derived context allowing the model to extrapolate to drugs absent from the perturbation training set. As a recent preprint, full architectural hyperparameters, the exact benchmark suite, and code/weight availability should be confirmed against the paper; reported emphasis is on improved zero-shot generalization to unprofiled compounds relative to identifier-based baselines.

Applications

MAP is aimed at computational pharmacology and early drug discovery, where researchers want to anticipate the transcriptional consequences of a candidate compound before committing to expensive single-cell perturbation experiments. By supporting zero-shot prediction, it is particularly useful for triaging large chemical libraries or novel scaffolds for which no Perturb-seq or sci-Plex data exist, and for generating mechanistic hypotheses that connect a drug to specific gene programs.

Impact

MAP reflects a broader shift in single-cell perturbation modeling toward knowledge-informed representations, arguing that mechanistic priors—not just larger expression datasets—are key to generalizing beyond profiled compounds. If its zero-shot gains hold under independent evaluation, the framework could reduce the experimental burden of screening unprofiled drugs. As a February 2026 preprint, its adoption, released resources, and benchmark standing remain to be established through peer review and community use.

Citation

MAP: A Knowledge-driven Framework for Predicting Single-cell Responses for Unprofiled Drugs

Feng, J., et al. (2026) MAP: A Knowledge-driven Framework for Predicting Single-cell Responses for Unprofiled Drugs. bioRxiv.

DOI: 10.64898/2026.02.25.708091

Recent citations

Papers that recently cited this model.

Not enough citation data yet.

Top citations

The most-cited papers that cite this model.

Not enough citation data yet.

Citations

Total Citations0

Influential0

References63

Fields of citing research

Not enough data

Openness

bio.rodeo opennessClosed · low usability and reproducibility

12Closed

Usability — can I run it?9

Reproducibility — can I retrain it?13

Model Openness Framework

Unclassified

Missing required components

Resources

Research Paper

Key Features

Knowledge-grounded drug and gene embeddings: Compound and gene representations are pre-trained on a dedicated knowledge graph rather than learned from perturbation data alone, encoding mechanistic context that supports generalization.

Zero-shot prediction for unprofiled drugs: Because embeddings capture mechanism, MAP can predict single-cell responses for small molecules that have no measured perturbation profile in the training data.

Multimodal alignment: A contrastive pre-training strategy aligns molecular structure, protein-sequence features of targets, and natural-language mechanistic descriptions into a shared embedding space.

Purpose-built knowledge graph (MAP-KG): Unifies 14 public resources spanning roughly 187,000 drugs, 23,000 genes, and 694,000 mechanistic relationships.

Technical Details

Applications

Impact

MAP

Key Features

Technical Details

Applications

Impact

Citation

MAP: A Knowledge-driven Framework for Predicting Single-cell Responses for Unprofiled Drugs

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

MAP

Key Features

Technical Details

Applications

Impact

Citation

MAP: A Knowledge-driven Framework for Predicting Single-cell Responses for Unprofiled Drugs

Recent citations

Top citations

Citations

Fields of citing research

Openness

Tags

Resources

MAP

#Key Features

#Technical Details

#Applications

#Impact

Citation

MAP: A Knowledge-driven Framework for Predicting Single-cell Responses for Unprofiled Drugs

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

MAP

#Key Features

#Technical Details

#Applications

#Impact

Citation

MAP: A Knowledge-driven Framework for Predicting Single-cell Responses for Unprofiled Drugs

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact