All Competitors
Every biological foundation model, evaluated and ranked by the bio.rodeo team
Showing 1–24 of 68 filtered models
LDARNet
—1—A 120M-parameter genomic foundation model that learns adaptive DNA token boundaries via H-Net-style dynamic chunking instead of fixed k-mer or byte-pair tokenization.
DNA & Gene26OpennessDanioDecima
———A zebrafish DNA sequence-to-function model predicting cell-type-specific single-cell expression across 85 cell-type x developmental-timepoint combinations during embryogenesis.
DNA & GeneSingle-cell22OpennessDamageFormer
1——Multimodal deep-learning framework that detects and localizes DNA lesions directly from native nanopore sequencing, built on the damage-aware LesionBERT foundation model.
DNA & Gene45OpennessBio-BLIP
———A multimodal Q-former that fuses DNA sequence, gene context, protein function, and text into a prefix for a frozen LLM, enabling zero-shot genetic variant interpretation.
DNA & GeneLanguage model23OpennessWisteria
———A pretrained DNA language model combining Mamba state-space layers, gated dilated convolutions, and Fourier-based attention to capture multi-scale genomic regulatory patterns.
DNA & Gene10OpennessCarbon
193—7.4KAn open autoregressive genomic foundation model (0.5B–8B params) with a 6-mer DNA tokenizer, matching Evo2-7B win rates at far higher throughput.
DNA & Gene93OpennessMIMIC
30——Generative multimodal foundation model that jointly models DNA, RNA, protein, and cellular context across six biological modalities, with SOTA splicing prediction.
RNAProteinDNA & Gene16OpennessGenNA
———An autoregressive nucleotide-and-text foundation model pretrained on ~416B characters from 2,221 eukaryotic species for natural-language-guided conditional generation of DNA and RNA sequences.
DNA & GeneRNA16OpennessOneGenome-Rice
19—135A 1.25B-parameter Mixture-of-Experts genomic foundation model for rice, pretrained on 422 Oryza genomes with a 1 Mbp context window.
DNA & Gene90OpennessDeep-Plant
1——A supervised, chromatin-informed foundation model that predicts regulatory activity directly from plant genomic sequence in Arabidopsis and rice.
DNA & Gene87OpennessGenoJEPA
———Beijing University of Posts and TelecommunicationsApril 6, 2026foundation_modelgenomicsrepresentation_learning+4A genomic foundation model that learns DNA representations through joint-embedding prediction in latent space rather than nucleotide reconstruction.
DNA & Gene22OpennessPlantCAD2
894—A long-context plant DNA language model (676M params, Mamba2) pretrained on 65 angiosperm genomes for cross-species functional annotation.
DNA & Gene69OpennessGenerative discrete-diffusion model that designs regulatory DNA with tunable activity and learns activity-predictive representations rivaling genomic language models.
DNA & Gene49OpennessPatchDNA
—2—A DNA language model that replaces fixed tokenization with conservation-guided patching, letting models up to 10x smaller match or beat state-of-the-art genomic benchmarks.
DNA & Gene33OpennessFishMamba-1
——9Institute of Hydrobiology, Chinese Academy of SciencesMarch 9, 2026dnafoundation_modelgenome_annotation+4Genomic foundation model for Cypriniformes fish, built on a Mamba-2 state space model with a 32 kb context window for long-range genome modeling.
DNA & Gene50OpennessPopformer
———Self-supervised transformer for population genetics, pretrained on 1000 Genomes data, that learns selection signatures via site- and haplotype-wise attention.
DNA & Gene19OpennessD3LM
—1—A bidirectional masked discrete diffusion language model over DNA, initialized from Nucleotide Transformer v2, that unifies DNA understanding and generation.
DNA & Gene58OpennessresLens
—1—George Washington UniversityFebruary 16, 2026antibiotic_resistance_gene_detectiongenomicslanguage_model+4A family of genomic language models fine-tuned from a pretrained DNA language model to detect and classify antibiotic resistance genes beyond reference-database limits.
DNA & Gene11OpennessdnaHNet
—1—A tokenizer-free, hierarchical autoregressive genomic foundation model that adaptively chunks raw nucleotides, enabling efficient long-context learning and zero-shot variant and gene predictions.
DNA & Gene12OpennessAntigenLM
———A structure-aware generative DNA language model pretrained on influenza genomes that forecasts future antigenic variants across regions and subtypes.
DNA & Gene5OpennessOpticalDNA
———An OCR-inspired vision-language model that renders DNA as visual layouts to analyze long genomic sequences with far fewer tokens than sequential tokenizers.
DNA & Gene16OpennessevoRate
———A genome language model that adds evolutionary-rate prediction as a pretraining task, improving representations and variant effect prediction over sequence-only training.
DNA & Gene14Openness