All Competitors

Every biological foundation model, evaluated and ranked by the bio.rodeo team

Showing 119 of 19 filtered models

  • Merlin

    41912712.5K
    Stanford UniversityJanuary 1, 2026cnncontrastive_learningct+8

    A 3D vision-language foundation model for abdominal CT that pretrains on paired scans, radiology reports, and structured EHR codes for zero-shot interpretation.

    ImagingLanguage model
    54Openness
  • NeuroVFM

    204
    University of Michigan +1 otherNovember 23, 2025ctfoundation_modeljoint_embedding_predictive_architecture+8

    A generalist neuroimaging vision foundation model pretrained on 5.24M clinical MRI and CT volumes for radiologic diagnosis and report generation.

    Imaging
    57Openness
  • Lingshu

    31554.1K
    DAMO Academy +1 otherJune 8, 2025clinical_reasoningfoundation_modelhistology+7

    A generalist medical multimodal LLM built on Qwen2.5-VL for unified medical image understanding, visual question answering, report generation, and clinical reasoning across 12+ imaging modalities.

    ImagingLanguage model
    70Openness
  • UniBiomed

    619215
    Hong Kong University of Science and Technology +2 othersApril 30, 2025foundation_modelhistologymultimodal+6

    Universal foundation model that jointly generates diagnostic text and segments the corresponding targets across ten biomedical imaging modalities.

    ImagingLanguage model
    64Openness
  • LLaVA-Rad

    58631.2K
    Microsoft ResearchFebruary 20, 2025chest_x_rayfoundation_modelimage_text_retrieval+5

    Lightweight 7B vision-language foundation model from Microsoft Research, released research-only under the Microsoft Research License, that generates radiology findings from chest X-rays.

    ImagingLanguage model
    35Openness
  • MINIM

    158127
    Peking University +2 othersFebruary 1, 2025data_augmentationdiffusionfoundation_model+9

    A self-improving text-to-image diffusion foundation model that generates synthetic medical images across multiple modalities and organs to augment downstream clinical AI tasks.

    Imaging
    41Openness
  • Brainfound

    3
    Tsinghua University +2 othersJanuary 10, 2025contrastive_learningcross_modality_translationdiffusion+9

    A multimodal vision-text foundation model for brain CT and MRI, pretrained on ~10M paired images and reports to act as a clinical copilot across seven imaging tasks.

    ImagingLanguage model
    7Openness
  • BiMediX2

    731620
    Mohamed bin Zayed University of Artificial IntelligenceDecember 10, 2024histologyinstruction_tuninglanguage_model+7

    A bilingual (Arabic-English) bio-medical large multimodal model built on Llama 3.1 for medical image understanding and clinical text conversation.

    Language modelImagingPathology
    11Openness
  • MedRegA

    452613
    Hong Kong University of Science and Technology +1 otherOctober 24, 2024histologyimage_classificationinstruction_tuning+8

    Region-aware bilingual (Chinese-English) medical multimodal LLM that handles image- and region-level vision-language tasks across eight imaging modalities.

    PathologyLanguage model
    65Openness
  • PULSE

    6427999
    The Ohio State University +1 otherOctober 21, 2024cardiologyecgecg_interpretation+6

    A multimodal LLM fine-tuned to interpret electrocardiogram images, trained on the >1M-sample ECGInstruct dataset and evaluated on the ECGBench benchmark.

    BiosignalsImaging
    84Openness
  • ECG-Chat

    8044
    China University of Geosciences +2 othersAugust 16, 2024cardiologycontrastive_learningdisease_classification+6

    A multimodal ECG-language model that aligns 12-lead ECG waveforms with clinical text for conversational cardiac diagnosis and automated report generation.

    BiosignalsLanguage model
    27Openness
  • LLaVA-Tri

    409843
    UC Santa Cruz +3 othersAugust 6, 2024foundation_modelhistologymultimodal+5

    A medical multimodal large language model pretrained on the 25M-image MedTrinity-25M dataset, achieving state-of-the-art accuracy on biomedical visual question answering.

    Language modelPathology
    30Openness
  • MedDr

    982655
    Hong Kong University of Science and TechnologyApril 23, 2024foundation_modelhistologyinstruction_tuning+7

    A 40B-parameter generalist medical vision-language foundation model spanning radiology, pathology, dermatology, retinography, and endoscopy.

    ImagingLanguage model
    69Openness
  • M3D

    442159873
    Beijing Academy of Artificial IntelligenceMarch 31, 2024ctimage_text_retrievalinstruction_tuning+9

    A multimodal large language model for 3D medical imaging, handling retrieval, report generation, VQA, positioning, and segmentation on CT volumes.

    ImagingLanguage model
    77Openness
  • CheXagent

    226711.3K
    Stanford UniversityJanuary 22, 2024chest_x_rayfoundation_modelimage_classification+7

    An instruction-tuned vision-language foundation model from Stanford for interpreting and summarizing chest X-rays across eight clinical task types.

    ImagingLanguage model
    32Openness
  • UniBrain

    39
    Shanghai Jiao Tong University +3 othersSeptember 13, 2023brain_mricnncontrastive_learning+6

    Hierarchical knowledge-enhanced vision-language pre-training model for universal brain MRI diagnosis across 10+ diseases from multi-modal scans and reports.

    Imaging
    35Openness
  • RadFM

    553227
    Shanghai Jiao Tong University +1 otherAugust 4, 2023disease_diagnosisfoundation_modelgenerative+7

    A generalist radiology foundation model that handles interleaved 2D and 3D medical scans with text for diagnosis, VQA, and report generation.

    ImagingLanguage model
    84Openness
  • PathAsst

    13395
    Westlake University +3 othersMay 24, 2023cytologyfoundation_modelhistology+7

    A multimodal generative AI assistant for pathology, pairing the PathCLIP vision encoder with a Vicuna-13B LLM and a toolkit of eight pathology-specific models.

    PathologyLanguage model
    17Openness
  • PTUnifier

    7852
    Chinese University of Hong Kong, Shenzhen +2 othersFebruary 17, 2023chest_x_rayfoundation_modelimage_text_retrieval+8

    Prompt-based medical vision-language pretraining that unifies fusion-encoder and dual-encoder architectures, handling image-only, text-only, and image-text inputs in one model.

    PathologyLanguage model
    56Openness