University of Cambridge / AstraZeneca
A transformer that predicts protein-RNA binding affinity from Boltz-2 pre-structural embeddings via cross-modal attention, without requiring predicted structures.
ZeroFold addresses a long-standing challenge in structural biology: predicting how tightly a protein binds an RNA molecule. Accurate protein-RNA affinity prediction matters for understanding post-transcriptional gene regulation and for designing RNA-targeting therapeutics, yet it has remained largely unsolved. A central obstacle is the conformational flexibility of RNA, which—unlike most proteins—exists as a dynamic ensemble rather than a single dominant fold. Committing to one predicted structure discards information that is relevant to binding, limiting the usefulness of structure-first prediction pipelines for this problem.
The model's core idea is to bypass explicit structure prediction altogether. Rather than decoding a 3D structure and scoring it, ZeroFold extracts pre-structural embeddings—the intermediate representations produced by a biomolecular foundation model just before its structure-decoding step. The authors argue these embeddings implicitly encode conformational-ensemble information, making them a natural representation for flexible biomolecules such as RNA. ZeroFold builds a trained transformer on top of these embeddings to map a protein-RNA pair directly to a binding-affinity prediction from sequence alone.
ZeroFold was developed by Josef Hanke, Sebastian Pujalte Ojeda, Shengyu Zhang, Werngard Czechtizky, Leonardo De Maria, and Michele Vendruscolo at the Yusuf Hamied Department of Chemistry, University of Cambridge, in collaboration with AstraZeneca's medicinal chemistry group, and was posted to arXiv in March 2026.
ZeroFold is a transformer-based predictor that consumes pre-structural embeddings from Boltz-2 for both the protein and the RNA, fuses them with cross-modal attention, and runs as a fixed checkpoint at inference on new sequences. Training and evaluation use PRADB, a curated set of 2,621 unique protein-RNA pairs with experimental affinities aggregated from four databases. On a held-out test set built with a 40% sequence-identity threshold to limit train-test leakage, ZeroFold reaches a Spearman correlation of 0.65, which the authors describe as approaching the ceiling imposed by experimental measurement noise. Under progressively stricter evaluation that accounts for overlap with competitor training sets, ZeroFold compares favourably with both leading structure-based and leading sequence-based affinity predictors. The preprint does not report a parameter count or full architectural hyperparameters.
ZeroFold targets researchers and drug-discovery teams working on RNA biology and RNA-targeting modalities. Because it estimates binding affinity from sequence without requiring a predicted or experimental complex structure, it is well suited to prioritising candidate protein-RNA pairs at scale, screening RNA-binding proteins, and exploring interactions for which no structural data exist—settings where structure-first pipelines stall. The pre-structural-embedding strategy is also of methodological interest to teams building affinity or interaction models for other flexible biomolecules.
ZeroFold contributes to a growing line of work that repurposes the internal representations of biomolecular foundation models such as Boltz-2, rather than only their final structural outputs, and argues that these intermediate features are particularly valuable for conformationally flexible systems like RNA. By framing protein-RNA affinity prediction as a problem solvable from pre-structural embeddings, it offers a route to a class of predictions that has been difficult for structure-based methods. As of this writing the work is a preprint, and no public code or trained weights have been released; the AstraZeneca collaboration may also constrain weight distribution. Independent benchmarking and a public implementation would help establish how broadly the approach generalises.