All Competitors

Every biological foundation model, evaluated and ranked by the bio.rodeo team

Showing 119 of 19 filtered models

  • ProtSent

    6
    Hebrew University of Jerusalem +1 otherMay 7, 2026contrastive_learningembeddingsproteomics+5

    Contrastively fine-tuned ESM-2 (35M and 150M) protein language models that produce general-purpose sequence embeddings where biological similarity maps to embedding proximity.

    Protein
    87Openness
  • GATSBI

    13
    Stanford UniversityApril 3, 2026embeddingsfunction_predictiongraph_attention_network+4

    A graph-attention model producing context-aware protein embeddings from protein-protein interaction, co-expression, and tissue networks, with biologically motivated data splits.

    Protein
    94Openness
  • Duke UniversityMarch 8, 2026embeddingsknowledge_distillationproteomics+3

    Reverse-distilled ESM-2 checkpoints (up to 15B) producing Matryoshka-style nested embeddings that scale consistently and reach state of the art on ProteinGym.

    Protein
    58Openness
  • ProtAlign

    Lawrence Livermore National LaboratoryMarch 6, 2026contrastive_learningcross_modal_retrievalembeddings+4

    A contrastive cross-modal encoder that aligns protein sequence (ESM-2) and structure (ProteinMPNN) representations into a shared embedding space for cross-modal retrieval.

    Protein
    35Openness
  • EnzPlacer

    Iowa State UniversityFebruary 23, 2026contrastive_learningec_number_predictionembeddings+6

    A contrastive-learning model that predicts the first three Enzyme Commission (EC) digits for enzymes whose exact (fourth-level) function was never seen during training.

    Protein
    59Openness
  • EVA

    101
    Scienta LabFebruary 10, 2026embeddingsfoundation_modelgene_expression+9

    Cross-species, multimodal foundation model of immunology and inflammation that harmonizes transcriptomics and histology into unified patient-level representations.

    Single-cellRNAPathology
    27Openness
  • TM-Vec 2

    1
    Arizona State UniversityFebruary 5, 2026embeddingshomology_detectionproteomics+3

    A distilled deep learning model that predicts structural similarity between proteins directly from sequence, reaching up to 258x speedups for large-scale homology search.

    Protein
    4Openness
  • MetagenBERT

    MetagenBERT AuthorsJanuary 5, 2026bertdnaembeddings+5

    A pipeline that builds whole-metagenome embeddings directly from raw DNA reads using genomic language models and FAISS k-means clustering, without taxonomic or functional annotation.

    DNA & Gene
    22Openness
  • MicroGenomer

    BGI ResearchDecember 29, 2025embeddingsfoundation_modelgenomics+6

    A 470M-parameter microbial genome foundation model trained hierarchically on 234.5B bp for multi-scale genomic understanding and ecophysiological trait prediction.

    DNA & Gene
    44Openness
  • vir2vec

    University of FloridaDecember 12, 2025embeddingsfoundation_modelgenomics+6

    A 422M-parameter pan-viral genomic language model that produces fixed genome-level embeddings reused across viral classification tasks without re-training.

    DNA & Gene
    53Openness
  • MNCN-CSICJuly 9, 2025bertembeddingsfoundation_model+4

    A BERT-style transformer language model built on the Geneformer framework and trained on zebrafish single-cell transcriptomics to produce gene and cell embeddings for developmental analysis.

    Single-cell
    46Openness
  • Microsoft ResearchOctober 31, 2024co_evolutionembeddingsfoundation_model

    A transformer protein language model using integrative co-evolutionary pre-training to capture both short-range and long-range residue interactions from sequence alone.

    Protein
    10Openness
  • DNABERT-S

    1304611.7K
    MAGICS LabFebruary 13, 2024contrastive_learningdnaembeddings+3

    Species-aware DNA embedding model built on DNABERT-2, using contrastive learning to cluster and differentiate genomic sequences by species without labeled data.

    DNA & Gene
    53Openness
  • Ankh

    248696.7K
    Technical University of MunichJanuary 16, 2023efficient_inferenceembeddingsfoundation_model+1

    Optimized protein language model that surpasses state-of-the-art performance with fewer than 10% of the parameters of comparable models.

    Protein
    24Openness
  • CARP

    259
    Microsoft ResearchMay 19, 2022embeddingsfoundation_modelvariant_effect_prediction

    CNN-based protein language model series showing convolutions match transformer performance on sequence pretraining while scaling linearly with sequence length.

    Protein
    81Openness
  • ProtTrans

    1.3K1.3K
    RostlabAugust 1, 2021embeddingsfoundation_modelself_supervised+1

    A suite of six protein language models — including ProtBERT and ProtT5 — trained on up to 393 billion amino acids using large-scale HPC infrastructure.

    Protein
    71Openness
  • ESM-1b

    4.1K17K
    Meta AIApril 5, 2021embeddingsfoundation_modelvariant_effect_prediction

    Transformer protein language model trained on 250 million protein sequences that learns structural and functional representations without supervision.

    Protein
    71Openness
  • TAPE

    7391K
    UC BerkeleyJune 19, 2019benchmarkembeddingsevaluation+1

    Benchmark suite of five biologically relevant tasks for evaluating protein sequence representation learning, covering structure, homology, and engineering.

    Protein
    89Openness
  • UniRep

    3651.1K
    Church LabJanuary 1, 2019embeddingsfoundation_model

    A multiplicative LSTM protein language model trained on 24M sequences to produce fixed-length embeddings for protein engineering and function prediction.

    Protein
    49Openness