AlphaFold-Multimer

Protein complex structure prediction model extending AlphaFold 2 with paired MSA processing and ipTM scoring for multi-chain, multimeric assemblies.

Released: October 2021

AlphaFold-Multimer is an extension of AlphaFold 2 developed by Richard Evans, Michael O'Neill, Alexander Pritzel, Natasha Antropova, Andrew Senior, Tim Green, Augustin Zidek, Russ Bates, Sam Blackwell, Jason Yim, and colleagues at Google DeepMind, with a preprint deposited on bioRxiv in October 2021. Where AlphaFold 2 was trained and optimized for single-chain protein structure prediction, AlphaFold-Multimer specifically addresses the challenge of predicting how multiple protein chains come together to form functional complexes. Protein complexes — from simple homodimers to large hetero-oligomeric assemblies — underlie virtually all of molecular biology, and accurately predicting their structures poses distinct challenges beyond what single-chain folding requires.

The core problem AlphaFold-Multimer addresses is that the original AlphaFold 2 architecture, while it can be applied to multi-chain inputs in an ad hoc fashion, was not designed to encode cross-chain evolutionary information or to reason about the symmetry and permutational equivalence of identical subunits. Protein complexes have coevolved interfaces: residues on one chain that contact a partner chain tend to vary in correlated ways across evolution, and this co-evolutionary signal encodes the geometry of the interface. Extracting this signal requires carefully pairing sequences from the same organism across the chains of a heteromeric complex in the MSA — a non-trivial alignment task that the standard AlphaFold pipeline does not perform.

AlphaFold-Multimer was released as part of the same open-source AlphaFold GitHub repository used for AlphaFold 2, making it immediately accessible to the structural biology community with the same infrastructure. It went on to become one of the most widely used tools for protein complex structure prediction, serving as the baseline that subsequent multi-chain prediction methods including AF2-Multimer v2 and v3 improvements, AFsample, and AlphaFold 3 were benchmarked against and built upon.

Key Features

Paired MSA construction: Constructs paired multiple sequence alignments between chains by matching sequences from the same organism using species annotation and disambiguating by genetic distance (for prokaryotes) or similarity to target sequences (for eukaryotes), capturing cross-chain co-evolutionary information that encodes interface geometry.
Permutation symmetry handling: Modifies the loss function to account for permutation symmetry among identical chains in homomeric and hetero-symmetric complexes, enabling the model to correctly handle assemblies where chains are interchangeable.
Interface-specific confidence metric (ipTM): Introduces the interface predicted TM-score (ipTM), which estimates the accuracy of predicted inter-chain contacts, complementing the per-chain pTM score. The combined ranking score (ipTM × 0.8 + pTM × 0.2) correlates well with actual complex accuracy and guides model selection.
Multimer-specific training data: Trained on multi-chain entries from the Protein Data Bank, with dedicated procedures for selecting and cropping residue subsets to ensure the training distribution covers diverse interface types and stoichiometries.
High homomeric and heteromeric accuracy: Successfully predicts interfaces (DockQ ≥ 0.23) in approximately 70-72% of homomeric and heteromeric benchmark cases, with high-accuracy predictions (DockQ ≥ 0.8) in 26-36% of cases depending on complex type.
Direct integration with AlphaFold 2 codebase: Available as part of the same open-source AlphaFold repository, sharing infrastructure, weights, and tooling, and enabling straightforward comparison with single-chain predictions for the same protein.

Technical Details

AlphaFold-Multimer shares the same fundamental architecture as AlphaFold 2 — a 93-million-parameter network built around the 48-block Evoformer stack and the Invariant Point Attention-based Structure Module — but introduces several targeted modifications for multi-chain inputs. The most critical is MSA pairing: for heteromeric complexes, sequences from the MSAs of individual chains are paired by matching organisms using UniProt species annotation. When multiple sequences from the same species exist for a chain, candidates are ranked by similarity to the respective target sequence, and pairs of equal rank are concatenated into cross-chain rows of the MSA. Both paired (cross-chain) and unpaired (single-chain) MSA rows are used at training and inference, allowing the model to draw on both intra-chain evolutionary information and cross-chain co-evolution signals simultaneously.

The model also modifies residue cropping during training to preferentially sample interface residues, ensuring that interface contacts are well-represented in training batches rather than diluted by large internal regions. Loss function modifications include terms for inter-chain distances and a permutation-equivariant formulation that allows the model to correctly score symmetric assemblies without being sensitive to arbitrary chain ordering in the input. At inference, the model outputs five candidate complex structures per submission (sampled from different random seeds), ranked by the composite ipTM + pTM confidence score. On benchmark datasets of heterodimers without templates, AlphaFold-Multimer achieves at least medium accuracy (DockQ ≥ 0.49) on approximately 70% of cases and high accuracy (DockQ ≥ 0.8) on approximately 26%. Performance is generally stronger for homomers than heteromers, consistent with the greater availability of co-evolutionary signal within a single chain's MSA for homomeric interfaces.

Applications

AlphaFold-Multimer is broadly used in any context where the structure of a protein complex is needed and experimental data are unavailable or insufficient. Drug discovery teams use it to model drug target proteins in their biologically relevant oligomeric state — a membrane receptor dimer, a protease-inhibitor complex, or an enzyme-cofactor assembly — where monomeric models may be misleading. Structural biologists use AlphaFold-Multimer predictions as molecular replacement models for phasing X-ray crystallography data collected on multi-chain assemblies and as initial models for fitting cryo-EM density maps of large complexes. Biologists studying protein-protein interaction networks use it to generate structural hypotheses for binary interactions identified in pull-down or two-hybrid experiments, prioritizing pairs for follow-up experimental validation. In immunology, AlphaFold-Multimer is widely applied to predict antibody-antigen complexes, MHC-peptide-TCR ternary complexes, and cytokine-receptor assemblies. Systems biologists model entire pathway-relevant assemblies — signaling complexes, scaffolding protein networks — to understand how mutations at one chain's interface alter the geometry of adjacent chains.

Impact

AlphaFold-Multimer substantially expanded the scope of accurate computational structural biology from individual proteins to multi-chain assemblies, enabling structural hypotheses about protein-protein interactions at proteome scale. Its release alongside the AlphaFold 2 open-source codebase ensured immediate, broad adoption with minimal infrastructure barrier. The ipTM confidence metric introduced by AlphaFold-Multimer has become a standard for evaluating predicted complex quality and is routinely reported alongside multimer predictions in the literature. The model served as the primary reference for CASP15's multimer prediction challenge (2022), where teams built upon and compared against it extensively. Subsequent improvements — AlphaFold-Multimer v2 (2022) and v3 updates incorporated into the AlphaFold GitHub releases — further improved interface accuracy, particularly for difficult cases with sparse MSAs. AlphaFold 3 eventually superseded AlphaFold-Multimer for many use cases by adopting a diffusion-based architecture with broader molecular coverage, but AlphaFold-Multimer remains the most widely cited and deployed tool for protein complex prediction due to its established open-source availability, well-characterized performance, and direct integration with the AlphaFold ecosystem. Key limitations include reduced accuracy for complexes with transient or flexible interfaces, for very large assemblies where chain cropping at training limits coverage of distant inter-chain contacts, and for complexes between proteins with sparse MSAs where the co-evolutionary signal that drives interface prediction is weak.

Citation

Protein complex prediction with AlphaFold-Multimer

Preprint

Evans, R., et al. (2021) Protein complex prediction with AlphaFold-Multimer. bioRxiv.

DOI: 10.1101/2021.10.04.463034

Recent citations

Papers that recently cited this model.

ConformationLab studio: local AlphaFold2-based protein structure prediction on macOS using ColabFold
Spike Murphy Müller
SoftwareX · Sep 2026
0
Peptide ligand discovery of G protein-coupled receptors
J. Hermes, Marin Matic, Ho Yan Yeung, et al.
Nature Reviews Methods Primers · Jul 2026
0
Minimal Data · Maximal Insight (MDMI): A Structure-guided Pipeline for Discovering Functional Alternatives in Peptide-Protein Interfaces
P. Bayat, Spencer J. Perkins, Sebastian Clancy, et al.
bioRxiv · Jul 2026
0

Top citations

The most-cited papers that cite this model.

Accurate structure prediction of biomolecular interactions with AlphaFold 3
Josh Abramson, Jonas Adler, Jack Dunger, et al.
Nature · May 2024
11.2KInfluential
ColabFold: making protein folding accessible to all
M. Mirdita, S. Ovchinnikov, Martin Steinegger
Nature Methods · May 2022
7.5K
Evolutionary-scale prediction of atomic level protein structure with a language model
Zeming Lin, Halil Akin, Roshan Rao, et al.
bioRxiv · Dec 2022
4.7K
Improved prediction of protein-protein interactions using AlphaFold2
P. Bryant, G. Pozzati, A. Elofsson
Nature Communications · Oct 2021
906
Generalized Biomolecular Modeling and Design with RoseTTAFold All-Atom
Rohith Krishna, Jue Wang, Woody Ahern, et al.
bioRxiv · Oct 2023
880

Citations

Total Citations3.2K

Influential518

References46

GitHub

Stars14.8K

Forks2.9K

Open Issues307

Contributors21

Last Push3mo ago

LanguagePython

LicenseApache-2.0

Fields of citing research

Biology11%
Medicine10%
Computer Science4%
Chemistry4%
Environmental Science1%
Materials Science1%
Engineering0%
Physics0%

Share of papers citing this model.

Openness

bio.rodeo opennessOpen weights · open weights, closed recipe

59Partial

Usability — can I run it?100

Reproducibility — can I retrain it?16

open weights, closed recipe

Model Openness Framework

Class III

Open Model

Resources

GitHub Repository Research Paper Official Website Documentation

Key Features

Paired MSA construction: Constructs paired multiple sequence alignments between chains by matching sequences from the same organism using species annotation and disambiguating by genetic distance (for prokaryotes) or similarity to target sequences (for eukaryotes), capturing cross-chain co-evolutionary information that encodes interface geometry.

Permutation symmetry handling: Modifies the loss function to account for permutation symmetry among identical chains in homomeric and hetero-symmetric complexes, enabling the model to correctly handle assemblies where chains are interchangeable.

Interface-specific confidence metric (ipTM): Introduces the interface predicted TM-score (ipTM), which estimates the accuracy of predicted inter-chain contacts, complementing the per-chain pTM score. The combined ranking score (ipTM × 0.8 + pTM × 0.2) correlates well with actual complex accuracy and guides model selection.

Multimer-specific training data: Trained on multi-chain entries from the Protein Data Bank, with dedicated procedures for selecting and cropping residue subsets to ensure the training distribution covers diverse interface types and stoichiometries.

High homomeric and heteromeric accuracy: Successfully predicts interfaces (DockQ ≥ 0.23) in approximately 70-72% of homomeric and heteromeric benchmark cases, with high-accuracy predictions (DockQ ≥ 0.8) in 26-36% of cases depending on complex type.

Direct integration with AlphaFold 2 codebase: Available as part of the same open-source AlphaFold repository, sharing infrastructure, weights, and tooling, and enabling straightforward comparison with single-chain predictions for the same protein.

Technical Details

Applications

Impact

AlphaFold-Multimer

#Key Features

#Technical Details

#Applications

#Impact

Citation

Protein complex prediction with AlphaFold-Multimer

Recent citations

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

AlphaFold-Multimer

#Key Features

#Technical Details

#Applications

#Impact

Citation

Protein complex prediction with AlphaFold-Multimer

Recent citations

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact