All-atom generative model for de novo protein and peptide binder design against diverse biomolecular targets, wet-lab validated across 26 targets.
BoltzGen is an all-atom generative model for de novo binder design, developed by researchers in the Barzilay and Jaakkola labs at MIT and released as a preprint in November 2025. Rather than predicting the structure of an existing complex, BoltzGen designs entirely new protein and peptide binders against a specified target, generating both backbone geometry and sequence. The goal stated by its authors is "universal" binder design: a single model that can produce binders across a wide range of target classes rather than a method tuned for one interaction type.
It is important not to confuse BoltzGen with the earlier Boltz-1 and Boltz-2 models from the same broader research community. Those are structure prediction models — Boltz-1 predicts biomolecular complex structures and Boltz-2 adds binding affinity estimation. BoltzGen is a fundamentally different tool: a generative design model whose output is a novel binder candidate, not a predicted structure of a known complex. In practice BoltzGen relies on a structure predictor (Boltz-2) downstream to evaluate and filter its generated designs, but the design step itself is the model's defining contribution.
What distinguishes BoltzGen is the breadth of targets it addresses within one framework and the scale of experimental validation reported. The authors describe wet-lab testing across 26 targets spanning 8 distinct design campaigns, recovering nanomolar binders for roughly 66% of the targets attempted — a level of cross-target generality that has historically been difficult for binder design methods to achieve.
BoltzGen uses a diffusion-based backbone generation model paired with an inverse folding model that predicts amino acid sequences for the generated backbones. The default configuration ships two design checkpoints — one optimized for diversity and one for adherence to design constraints — and uses a separate Boltz-2 model for downstream structure prediction and scoring. The model operates on full all-atom representations and uses canonical mmCIF residue indexing, with support for explicit constraints such as disulfide bonds and chemical cross-links. Pretrained weights total roughly 6 GB and download automatically on first use. The primary training source is Protein Data Bank structural data; an additional distillation dataset used for the larger model variant had not been publicly released at the time of writing. A typical design run generates on the order of 10,000–60,000 intermediate candidates that are then filtered to a small final set, and the workflow is GPU-based (tested on A100 hardware). The headline experimental result is the recovery of nanomolar binders for approximately 66% of the 26 wet-lab targets across 8 campaigns.
BoltzGen is aimed at researchers who need novel binding proteins or peptides for targets that lack existing binders, including therapeutic antibody and nanobody engineering, peptide therapeutics, biosensor and diagnostic reagent development, and tool-binder generation for basic research. Because it operates at the all-atom level and supports small-molecule targets, it is also applicable to designing proteins that recognize specific ligands or chemically modified epitopes. Its constraint system makes it useful for engineering tasks such as introducing stabilizing disulfides or designing symmetric assemblies. The open-source release — pip-installable with pretrained weights, training, and inference code under an MIT license — lowers the barrier for academic and biotech groups to incorporate de novo binder design into their pipelines.
BoltzGen extends the open Boltz ecosystem from structure and affinity prediction into generative design, and its central claim — competitive binder design across many target classes from a single model, backed by validation on 26 experimental targets — addresses one of the most practically valuable and difficult problems in protein engineering. The breadth of reported wet-lab success distinguishes it from binder design methods that demonstrate efficacy on only a handful of targets. As a preprint, its results await peer review, and some assets (notably the distillation dataset behind the larger variant) were not yet released, while a weights-specific license was not explicitly stated even though the code carries an MIT license. Even so, the rapid community uptake of the codebase and its integration with the widely used Boltz structure predictors position BoltzGen as an influential entry in the fast-moving field of computational binder design.
Stark, H., et al. (2025) BoltzGen: Toward Universal Binder Design. bioRxiv.
DOI: 10.1101/2025.11.20.689494Papers that recently cited this model.
Ahmed Sadek, Nolwen L. Rey, Antonin Kunka, et al.
bioRxiv · Jun 2026
Dipayan Sarkar, Chiranjib Sarkar
bioRxiv · Jun 2026
Laurent Ludwig, Michael Chungyoun, Jeffrey J. Gray
bioRxiv · Jun 2026
The most-cited papers that cite this model.
Protenix Team, Milong Ren, Jinyuan Sun, et al.
bioRxiv · Dec 2025
Yuxuan Zhang, Chengyue Gong, Hanyu Zhang, et al.
bioRxiv · Feb 2026
James P. Roney, Chenxi Ou, Sergey Ovchinnikov
bioRxiv · Mar 2026
Alexus A. Smith, Edmund Wong, R. Donovan, et al.
bioRxiv · Feb 2026
Jacques Boitreaud, Robert Chen, Jack Dent, et al.
bioRxiv · Dec 2025
Share of papers citing this model.