Boltz-2

Open model that jointly predicts biomolecular structure and small-molecule binding affinity, approaching FEP+ accuracy in seconds on a single GPU.

Released: June 2025

Boltz-2 is an open-source model developed by researchers at MIT CSAIL (Barzilay lab) in collaboration with Recursion Pharmaceuticals. Released in June 2025, it extends Boltz-1 by combining biomolecular structure prediction with direct binding affinity estimation for small-molecule complexes — making it the first openly available model to tackle both tasks within a single unified framework. This represents a meaningful shift in computational drug discovery, where structure prediction and affinity scoring have historically been handled by separate, disconnected pipelines.

The central innovation of Boltz-2 is its ability to predict the 3D binding pose of a small molecule within a protein active site while simultaneously estimating the relative binding free energy of that complex. On standard benchmarking datasets, Boltz-2 achieves a Spearman correlation of approximately 0.6 with FEP+ (free energy perturbation plus), which is considered the gold-standard computational method for binding affinity ranking. Critically, it delivers these predictions in seconds on a single GPU, compared to the hours or days typically required by physics-based free energy approaches running on high-performance computing clusters.

By releasing model weights, training code, and inference utilities under permissive open-source licensing, Boltz-2 lowers the barrier to high-quality affinity prediction for academic labs and smaller biotech organizations that lack access to commercial FEP software or large compute budgets.

Key Features

Joint Structure and Affinity Prediction: Simultaneously predicts the 3D binding pose of a small molecule within a protein active site and estimates relative binding free energy, unifying what previously required two separate modeling pipelines.
FEP+ Correlation: Achieves approximately 0.6 Spearman rank correlation with FEP+ on standard relative binding free energy benchmarks, approaching physics-based accuracy from a machine learning model.
Single-GPU Inference: Completes predictions in seconds per complex on a single consumer or workstation GPU, enabling rapid virtual screening without access to HPC infrastructure.
Open Weights and Code: Released under permissive open-source licensing with publicly available model weights, training code, and inference utilities via the jwohlwend/boltz GitHub repository.
Boltz-1 Foundation: Inherits Boltz-1's competitive structure prediction capabilities for proteins, nucleic acids, and small molecules, with affinity prediction added as an additional output head.

Technical Details

Boltz-2 extends the Boltz-1 diffusion-based architecture with a binding affinity prediction module. The core structure prediction backbone uses a denoising diffusion process to generate 3D atomic coordinates for protein-ligand complexes. A pairwise representation module encodes inter-atomic relationships between protein residues and ligand atoms, providing the geometric context required for both accurate pose prediction and affinity estimation. A learned regression head then operates on these complex-level representations to estimate relative binding free energies without requiring explicit thermodynamic cycle calculations.

Training is multi-task: the model is supervised simultaneously on structure prediction objectives using Protein Data Bank co-crystal structures and on affinity prediction objectives using experimental binding data (IC50, Kd, Ki values) drawn from ChEMBL and related databases. FEP benchmark datasets of curated congeneric series are used for evaluation. On structure prediction, Boltz-2 retains competitive performance on the PoseBusters benchmark for protein-ligand pose quality. On affinity prediction, the approximately 0.6 Spearman correlation with FEP+ is notable given that it is achieved at a tiny fraction of the computational cost; the practical ceiling for FEP+ itself on well-parameterized systems is around 0.8 Spearman correlation.

Applications

Boltz-2 is positioned primarily for structure-based drug discovery workflows. In lead optimization, it can rapidly rank congeneric compound series by predicted binding affinity to guide synthesis prioritization, replacing or augmenting expensive FEP+ campaigns in early discovery. It is also well-suited to virtual screening, where large libraries of candidate compounds are scored against a target in a single pass combining pose prediction and affinity estimation. Medicinal chemists can use Boltz-2 as a fast pre-filter before committing to full relative binding free energy (RBFE) calculations, substantially reducing the number of expensive simulations required. The ability to visualize predicted binding poses alongside affinity estimates also supports structure-guided design decisions about which scaffold positions to modify.

Impact

Boltz-2 represents a significant step toward democratizing binding affinity prediction, a capability that has historically been gated behind expensive commercial software (Schrodinger FEP+) and substantial computational resources. By making a model approaching FEP+ accuracy openly available, it enables academic labs and resource-constrained teams to conduct quantitative affinity-driven lead optimization. Key limitations remain: performance is strongest for congeneric series sharing a common scaffold, absolute affinity prediction and ranking across chemically diverse sets are more challenging, and the model predicts a single low-energy conformation without fully accounting for protein flexibility or allosteric effects. As of its release, the technical paper remains a preprint and has not yet undergone formal peer review. Nonetheless, Boltz-2 continues the trajectory established by Boltz-1 of making state-of-the-art computational biology tools openly accessible, and its unified structure-plus-affinity framework is likely to influence subsequent model designs in the field.

Citation

Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction

Preprint

Passaro, S., et al. (2025) Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction. bioRxiv.

DOI: 10.1101/2025.06.14.659707

Recent citations

Papers that recently cited this model.

From data to scent: validating an ensemble AI model that predicts novel insect odourant interactions
Leroy Bird, Brady Owen, Alaigne Maré, et al.
Journal of Molecular Modeling · Jul 2026
0
Peptide ligand discovery of G protein-coupled receptors
J. Hermes, Marin Matic, Ho Yan Yeung, et al.
Nature Reviews Methods Primers · Jul 2026
0
Characterising AlphaFold 3’s ability to predict T cell antigen specificity
Benjamin McMaster, Ali El Moselhy, Ilija Ilievski, et al.
bioRxiv · Jul 2026
0

Top citations

The most-cited papers that cite this model.

Have protein-ligand cofolding methods moved beyond memorisation?
Peter Škrinjar, Jérôme Eberhardt, G. Tauriello, et al.
bioRxiv · Aug 2025
71
What does AlphaFold3 learn about antibody and nanobody docking, and what remains unsolved?
Fatima N. Hitawala, Jeffrey J. Gray
mAbs · Aug 2025
37
Democratizing AI scientists using ToolUniverse
Shanghua Gao, Richard Zhu, Pengwei Sui, et al.
arXiv.org · Sep 2025
27
STELLA: Towards a Biomedical World Model with Self-Evolving Multimodal Agents
Ruofan Jin, Mingyang Xu, F. Meng, et al.
bioRxiv · Jan 2026
26
Benchmarking all-atom biomolecular structure prediction with FoldBench
Sheng Xu, Qiantai Feng, Lifeng Qiao, et al.
Nature Communications · Dec 2025
22

Citations

Total Citations495

Influential84

References0

GitHub

Stars4.1K

Forks863

Open Issues159

Contributors63

Last Push1mo ago

LanguagePython

LicenseMIT

Fields of citing research

Biology61%
Computer Science53%
Medicine50%
Chemistry34%
Environmental Science6%
Engineering5%
Physics3%
Materials Science3%

Share of papers citing this model.

Openness

bio.rodeo opennessOpen weights · open weights, closed recipe

71Open

Usability — can I run it?91

Reproducibility — can I retrain it?40

Model Openness Framework

Unclassified

No formal model card / data card

Resources

GitHub Repository Research Paper Official Website

Key Features

Joint Structure and Affinity Prediction: Simultaneously predicts the 3D binding pose of a small molecule within a protein active site and estimates relative binding free energy, unifying what previously required two separate modeling pipelines.

FEP+ Correlation: Achieves approximately 0.6 Spearman rank correlation with FEP+ on standard relative binding free energy benchmarks, approaching physics-based accuracy from a machine learning model.

Single-GPU Inference: Completes predictions in seconds per complex on a single consumer or workstation GPU, enabling rapid virtual screening without access to HPC infrastructure.

Open Weights and Code: Released under permissive open-source licensing with publicly available model weights, training code, and inference utilities via the jwohlwend/boltz GitHub repository.

Boltz-1 Foundation: Inherits Boltz-1's competitive structure prediction capabilities for proteins, nucleic acids, and small molecules, with affinity prediction added as an additional output head.

Technical Details

Applications

Impact

Boltz-2

#Key Features

#Technical Details

#Applications

#Impact

Citation

Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction

Recent citations

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Boltz-2

#Key Features

#Technical Details

#Applications

#Impact

Citation

Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction

Recent citations

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact