CALM-1.0

Contrastive antibody language model predicting antibody-antigen binding specificity from sequence with a dual-encoder, cross-attentive architecture.

Released: February 2026

CALM-1.0 (Contrastive Antibody Language Model) is a machine-learning framework for predicting antibody–antigen binding specificity directly from amino acid sequences. Determining which antibody binds which antigen is a central problem in immunology and therapeutic antibody discovery, and traditional approaches rely on experimental screening or structural modeling. CALM-1.0 instead frames antibody–antigen recognition as a sequence-to-sequence "molecular translation" problem, learning to align cognate pairs in a shared representation space.

Developed by the Reddy lab at ETH Zurich and posted as a bioRxiv preprint in February 2026, CALM-1.0 couples a dual-encoder design—separate encoders for antibody and antigen sequences—with a cross-attentive decoder. Contrastive learning pulls true binding pairs together and pushes non-binders apart in the embedding space, allowing the model to score and retrieve likely partners in either direction (antibody-to-antigen or antigen-to-antibody).

Note that CALM-1.0 is distinct from the similarly named codon language model "CaLM"; here the acronym refers to a contrastive model of antibody–antigen specificity rather than codon-level representation learning.

Key Features

Sequence-to-specificity prediction: Predicts antibody–antigen binding from amino acid sequence alone, without requiring experimentally determined structures.
Dual-encoder, cross-attentive decoder: Encodes antibody and antigen separately and couples them through cross-attention, treating recognition as molecular translation.
Contrastive alignment: Uses a contrastive objective to embed binding pairs in a shared space, enabling retrieval in both prediction directions.
Bidirectional retrieval: Supports querying for likely antigen partners given an antibody, or likely antibodies given an antigen.

Technical Details

CALM-1.0 is built on a dual-encoder plus cross-attentive decoder architecture trained with a contrastive learning objective that aligns cognate antibody–antigen pairs in a shared embedding space. The authors report training on 4,138 curated antibody–antigen pairs assembled from structural databases, and evaluate the model by retrieval on held-out sequences, reporting a mean top-1 retrieval rate of roughly 7% with performance demonstrated in both directions of prediction. The relatively small curated training set reflects the limited availability of paired antibody–antigen specificity data, which the contrastive formulation is designed to use efficiently.

Applications

CALM-1.0 is aimed at computational immunology and therapeutic antibody discovery, where identifying or prioritizing antibody–antigen pairs from sequence can accelerate candidate selection. By scoring and retrieving likely binding partners, it can help triage antibody repertoires against targets of interest, support epitope/partner hypothesis generation, and feed into broader antibody design pipelines—particularly in settings where structural data are unavailable.

Impact

CALM-1.0 contributes to the growing body of sequence-based antibody models by casting antibody–antigen specificity as a contrastive molecular-translation task with bidirectional retrieval. Its main current limitation is the scale of available paired training data: with a few thousand curated pairs and a modest top-1 retrieval rate, performance is an early proof of concept rather than a production tool. As a recent preprint, its results await peer review and independent benchmarking, and the framework would likely benefit substantially from larger curated specificity datasets.

Citation

Contrastive learning for antibody-antigen sequence-to-specificity prediction

Lee, H., et al. (2026) Contrastive learning for antibody-antigen sequence-to-specificity prediction. bioRxiv.

DOI: 10.64898/2026.02.25.707916

Recent citations

Papers that recently cited this model.

Generative Drug Design in a Loop with dtSFM
Sai T. Reddy
bioRxiv · Jun 2026
0
A Drug–Target Specificity Foundation Model for Off-target Prediction, Repurposing, and Generative Design
Sai T. Reddy
bioRxiv · Jun 2026
0
Vibe Coding Specificity Foundation Models
Sai T. Reddy
bioRxiv · Jun 2026
0

Top citations

The most-cited papers that cite this model.

A Drug–Target Specificity Foundation Model for Off-target Prediction, Repurposing, and Generative Design
Sai T. Reddy
bioRxiv · Jun 2026
0
Vibe Coding Specificity Foundation Models
Sai T. Reddy
bioRxiv · Jun 2026
0
Generative Drug Design in a Loop with dtSFM
Sai T. Reddy
bioRxiv · Jun 2026
0

Citations

Total Citations3

Influential0

References33

Fields of citing research

Biology100%
Computer Science100%
Medicine67%
Chemistry33%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

10Closed

Usability — can I run it?7

Reproducibility — can I retrain it?12

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

Sequence-to-specificity prediction: Predicts antibody–antigen binding from amino acid sequence alone, without requiring experimentally determined structures.

Dual-encoder, cross-attentive decoder: Encodes antibody and antigen separately and couples them through cross-attention, treating recognition as molecular translation.

Contrastive alignment: Uses a contrastive objective to embed binding pairs in a shared space, enabling retrieval in both prediction directions.

Bidirectional retrieval: Supports querying for likely antigen partners given an antibody, or likely antibodies given an antigen.

Technical Details

Applications

Impact

CALM-1.0

Key Features

Technical Details

Applications

Impact

Citation

Contrastive learning for antibody-antigen sequence-to-specificity prediction

Recent citations

Generative Drug Design in a Loop with dtSFM

A Drug–Target Specificity Foundation Model for Off-target Prediction, Repurposing, and Generative Design

Vibe Coding Specificity Foundation Models

Top citations

A Drug–Target Specificity Foundation Model for Off-target Prediction, Repurposing, and Generative Design

Vibe Coding Specificity Foundation Models

Generative Drug Design in a Loop with dtSFM

Citations

Fields of citing research

Openness

Tags

Resources

CALM-1.0

Key Features

Technical Details

Applications

Impact

Citation

Contrastive learning for antibody-antigen sequence-to-specificity prediction

Recent citations

Generative Drug Design in a Loop with dtSFM

A Drug–Target Specificity Foundation Model for Off-target Prediction, Repurposing, and Generative Design

Vibe Coding Specificity Foundation Models

Top citations

A Drug–Target Specificity Foundation Model for Off-target Prediction, Repurposing, and Generative Design

Vibe Coding Specificity Foundation Models

Generative Drug Design in a Loop with dtSFM

Citations

Fields of citing research

Openness

Tags

Resources

CALM-1.0

#Key Features

#Technical Details

#Applications

#Impact

Citation

Contrastive learning for antibody-antigen sequence-to-specificity prediction

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

CALM-1.0

#Key Features

#Technical Details

#Applications

#Impact

Citation

Contrastive learning for antibody-antigen sequence-to-specificity prediction

Recent citations

Top citations

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact