RNA foundation models
RNA

RNA Models

RNA structure, function, and expression modeling

52 models in this category

What RNA foundation models do

RNA foundation models learn the sequence, structural, and functional diversity of the transcriptome, tackling tasks that range from secondary structure prediction and splicing code modeling to non-coding RNA classification and mRNA design optimization. Unlike DNA, RNA folds into complex functional structures — hairpins, pseudoknots, ribozyme active sites — meaning that structure-aware representations are often essential rather than optional. Models like RNA-FM are pretrained on tens of millions of RNA sequences spanning multiple RNA families, giving them broad coverage of the transcriptome's functional landscape.

Applications: splicing, structure, and therapeutic design

Splicing prediction is one of the highest-demand applications: models trained on pre-mRNA sequences can score cryptic splice site activation by variants, a key step in interpreting pathogenic mutations. Secondary structure prediction benchmarks like bpRNA and ArchiveII are standard evaluation sets for RNA structure models. On the therapeutic side, RNA foundation models are increasingly applied to mRNA vaccine design — optimizing codon usage, secondary structure, and untranslated region elements to improve stability and expression.

Notable Models

Top-rated rna models from our evaluations

RNA-FM

AI for Science (PKU)

Released August 6, 2022

239378

A BERT-based RNA foundation model trained on 23.7 million non-coding RNA sequences, producing embeddings for structure prediction, functional annotation, and RNA design.

RNA
62Openness

RNABERT

Keio University

Released February 22, 2022

1267.3K56

A BERT-based model for RNA base embeddings that captures sequence context and secondary structure, enabling fast structural alignment and clustering.

RNA
34Openness

gRNAde

MRC Laboratory of Molecular Biology +1 other

Released December 1, 2025

Generative RNA inverse-design model that produces sequences conditioned on a target 3D backbone, secondary structure, and partial sequence constraints.

RNA
98Openness

RhoFold+

ml4bio

Released November 1, 2024

197143K237

End-to-end RNA 3D structure prediction combining the RNA-FM language model with Invariant Point Attention, achieving SOTA on RNA-Puzzles and CASP15.

RNA
65Openness

ERNIE-RNA

Tsinghua University

Released March 17, 2024

325.1K43

A structure-enhanced RNA language model that incorporates base-pairing constraints into self-attention, achieving state-of-the-art RNA structure and function prediction.

RNA
46Openness

A fully open structure-guided RNA foundation model pretrained on ~21M RNA sequences paired with secondary structures, enabling robust structural and functional inference.

RNA
92Openness

Frequently asked questions

What is an RNA foundation model?

An RNA foundation model is a large neural network pretrained on collections of RNA sequences — coding and non-coding — to learn representations of RNA sequence, structure, and function. These representations support downstream tasks including secondary structure prediction, splicing classification, and functional annotation of non-coding RNAs. RNA-FM is a well-known example trained on diverse RNA families.

How do RNA foundation models handle secondary structure?

Some RNA models, like those benchmarked on bpRNA, are trained with explicit base-pair annotations and learn to predict contact maps or dot-bracket structures directly. Others learn structural priors implicitly through sequence co-evolution patterns, similar to how protein language models capture contact information. The best approaches for therapeutic design often combine a pretrained sequence encoder with a structure-prediction decoder fine-tuned on experimentally determined structures.

Can RNA foundation models help with mRNA vaccine design?

Yes, and this has become a commercially relevant application. Optimizing codon usage, 5' and 3' UTR sequences, and mRNA secondary structure for stability and translation efficiency are all tasks where foundation model embeddings and generative models have shown promise. Several groups have reported that model-guided design outperforms simple codon optimization heuristics on in vitro stability benchmarks, though large-scale clinical validation is still limited.

What distinguishes RNA models from DNA models?

RNA models must account for the fact that RNA folds into functional three-dimensional structures with strong sequence-structure coupling — a constraint less central to most DNA modeling tasks. RNA models also often span diverse functional classes (mRNA, tRNA, rRNA, lncRNA, miRNA) that have very different sequence statistics, making broad pretraining corpora and multi-task objectives more important than in narrower DNA regulatory sequence models.