All Competitors
Every biological foundation model, evaluated and ranked by the bio.rodeo team
Showing 1–22 of 22 filtered models
PepForge
4——A hierarchical three-stage cascade that generates chemically modified and macrocyclic peptides in HELM notation, supporting de novo design and constrained infilling.
ProteinSmall molecule94OpennessDamageFormer
1——Multimodal deep-learning framework that detects and localizes DNA lesions directly from native nanopore sequencing, built on the damage-aware LesionBERT foundation model.
DNA & Gene45OpennessA 110M-parameter multimodal RNA language model that designs RNA sequences from secondary structure, consensus, and Gene Ontology constraints via discrete diffusion.
RNA48OpennessPeptideCLM-2
8——SMILES-based chemical language models pretrained on 100M+ molecules to natively represent therapeutic peptide chemistry, including non-canonical residues.
Small moleculeProtein79OpennessOncoBERT
———BERT-style language model that learns contextual representations of somatic mutations from clinical sequencing of >210,000 patients for cancer subtyping and therapy response.
DNA & Gene7OpennessNUWA
———An mRNA language foundation model trained on ~115M coding sequences across the tree of life for unified mRNA sequence perception and generation.
RNADNA & Gene16OpennessMetagenBERT
———A pipeline that builds whole-metagenome embeddings directly from raw DNA reads using genomic language models and FAISS k-means clustering, without taxonomic or functional annotation.
DNA & Gene22OpennessISTS
———Pan-cancer multi-omic BERT-like foundation model that jointly encodes CpG-island DNA methylation and RNA-seq for zero-shot cancer classification and mutation prediction.
Single-cellDNA & Gene20OpennessstructRFM
353210University of Science and Technology of ChinaAugust 7, 2025bertfoundation_modelhomology_classification+9A fully open structure-guided RNA foundation model pretrained on ~21M RNA sequences paired with secondary structures, enabling robust structural and functional inference.
RNA92OpennessZebraformer
—1—A BERT-style transformer language model built on the Geneformer framework and trained on zebrafish single-cell transcriptomics to produce gene and cell embeddings for developmental analysis.
Single-cell46OpennessBERT-based model pretrained on 15-state ROADMAP chromatin annotations across 127 human cell types to uncover chromatin-state motifs and predict gene expression.
DNA & Gene86OpennessMammo-CLIP
94—17Vision-language foundation model pre-trained on screening mammogram-report pairs to improve data efficiency and robustness in breast cancer detection.
ImagingPathology27OpennessCaLM
5443—Codon-level BERT model that captures genomic signals invisible to amino acid models, outperforming billion-parameter PLMs with just 86M parameters.
Protein66OpennessCXR-CLIP
121133—Large-scale chest X-ray vision-language pretraining model that learns image-report alignment for zero-shot and few-shot radiograph classification.
Imaging18OpennessSpecies-Aware DNA LM
2952183DNA language model trained on over 800 species spanning 500 million years of evolution to capture conserved regulatory elements and their evolution beyond sequence alignment.
DNA & Gene76OpennessSpliceBERT
551—A BERT-based RNA language model pre-trained on 2M+ pre-mRNA sequences from 72 vertebrate species for splicing prediction and variant effect analysis.
RNA77OpennessGeneBERT
—27—Multi-modal self-supervised model pre-trained on regulatory genome sequences and transcription factor binding matrices for cell-type-specific regulatory prediction.
DNA & Gene18Openness