SaDiT

Protein backbone generator running a diffusion transformer over SaProt structural tokens, with an IPA token cache to speed up de novo design.

Released: February 2026

SaDiT (SaProt-tokenized Diffusion Transformer) is a generative framework for de novo protein backbone design that aims to make backbone diffusion both faster and more structurally reliable. Many leading backbone generators—such as RFdiffusion and Proteina—operate directly in continuous 3D coordinate space, which couples each denoising step to relatively expensive geometric computation. SaDiT instead represents protein geometry in a discrete latent space using structural tokenization derived from SaProt, then applies a diffusion transformer (DiT) over those tokens, reducing the complexity of the generation process while aiming to preserve SE(3) equivariance.

The method was introduced in February 2026 by Shentong Mo and Lanqing Li as an arXiv preprint. Its central engineering contribution is an IPA Token Cache mechanism that optimizes the Invariant Point Attention (IPA) layers by reusing computed token states across iterative sampling steps, cutting redundant computation during generation. Together, the discrete tokenization and cached IPA are intended to deliver state-of-the-art speed without sacrificing the structural viability of generated backbones.

As a recent preprint, SaDiT reports comparisons against established baselines on standard backbone-generation tasks, but does not yet release public weights or code. Its results should be read as preprint-stage claims pending independent reproduction.

Key Features

Discrete structural tokenization: Protein geometry is encoded into a discrete latent space via SaProt-style structural tokens, simplifying the generative target relative to continuous coordinate diffusion.
Diffusion transformer backbone: A DiT architecture denoises over structural tokens, bringing scalable transformer-based diffusion to backbone generation.
SE(3) equivariance: The framework is designed to maintain theoretical SE(3) equivariance so generated structures respect rotational and translational symmetry.
IPA Token Cache: A caching mechanism reuses computed Invariant Point Attention token states across sampling steps to accelerate iterative generation.
Conditional generation: The model supports both unconditional and fold-class conditional backbone generation.

Technical Details

SaDiT couples a SaProt-derived structural tokenizer with a diffusion transformer that denoises in the discrete token space rather than over raw 3D coordinates. The reported headline contribution is efficiency: the IPA Token Cache reuses Invariant Point Attention states during iterative sampling, and the discrete formulation reduces per-step cost. The authors report that SaDiT outperforms state-of-the-art models including RFdiffusion and Proteina in both computational speed and structural viability across unconditional and fold-class conditional generation, with particular strength on capturing complex topological features. The preprint does not disclose a parameter count, and—at the time of writing—no public weights or code accompany it, so benchmark numbers reflect the authors' own evaluation.

Applications

SaDiT targets de novo protein backbone design, the first stage of many computational protein-engineering pipelines, where a generated backbone is subsequently sequence-designed (e.g., with an inverse-folding model) and validated. Faster, fold-conditioned backbone sampling is useful for scaffold generation, exploring topological space, and producing diverse candidate structures for downstream design campaigns in research settings.

Impact

SaDiT contributes to a growing line of work that moves protein backbone diffusion from continuous coordinate space into learned discrete structural-token spaces, trading some geometric directness for transformer scalability and sampling speed. If its reported speed and viability gains over RFdiffusion and Proteina hold up under independent evaluation, the IPA Token Cache and tokenized diffusion design could inform future efficient generators. As a February 2026 preprint without released weights, its real-world adoption and reproducibility remain open questions.

Citation

SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers

Preprint

Mo, S. & Li, L. (2026) SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers. arXiv.org.

DOI: 10.48550/arXiv.2602.06706

Recent citations

Papers that recently cited this model.

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization
Lanqing Li, Shentong Mo, Yang Yu, et al.
Jun 2026
0

Top citations

The most-cited papers that cite this model.

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization
Lanqing Li, Shentong Mo, Yang Yu, et al.
Jun 2026
0

Citations

Total Citations1

Influential0

References13

Fields of citing research

Biology100%
Computer Science100%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

5Closed

Usability — can I run it?7

Reproducibility — can I retrain it?0

not reproducible

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

Research Paper

Key Features

Discrete structural tokenization: Protein geometry is encoded into a discrete latent space via SaProt-style structural tokens, simplifying the generative target relative to continuous coordinate diffusion.

Diffusion transformer backbone: A DiT architecture denoises over structural tokens, bringing scalable transformer-based diffusion to backbone generation.

SE(3) equivariance: The framework is designed to maintain theoretical SE(3) equivariance so generated structures respect rotational and translational symmetry.

IPA Token Cache: A caching mechanism reuses computed Invariant Point Attention token states across sampling steps to accelerate iterative generation.

Conditional generation: The model supports both unconditional and fold-class conditional backbone generation.

Technical Details

Applications

Impact

SaDiT

Key Features

Technical Details

Applications

Impact

Citation

SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers

Recent citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Top citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Citations

Fields of citing research

Openness

Tags

Resources

SaDiT

Key Features

Technical Details

Applications

Impact

Citation

SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers

Recent citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Top citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Citations

Fields of citing research

Openness

Tags

Resources

SaDiT

#Key Features

#Technical Details

#Applications

#Impact

Citation

SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers

Recent citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Top citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Related models

Citations

Fields of citing research

Openness

Tags

Resources

SaDiT

#Key Features

#Technical Details

#Applications

#Impact

Citation

SaDiT: Efficient Protein Backbone Design via Latent Structural Tokenization and Diffusion Transformers

Recent citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Top citations

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

Related models

Citations

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact