All Competitors

Every biological foundation model, evaluated and ranked by the bio.rodeo team

Showing 18 of 8 filtered models

  • LLaVA-Rad

    58631.2K
    Microsoft ResearchFebruary 20, 2025chest_x_rayfoundation_modelimage_text_retrieval+5

    Lightweight 7B vision-language foundation model from Microsoft Research, released research-only under the Microsoft Research License, that generates radiology findings from chest X-rays.

    ImagingLanguage model
    35Openness
  • M3D

    442159873
    Beijing Academy of Artificial IntelligenceMarch 31, 2024ctimage_text_retrievalinstruction_tuning+9

    A multimodal large language model for 3D medical imaging, handling retrieval, report generation, VQA, positioning, and segmentation on CT volumes.

    ImagingLanguage model
    77Openness
  • CXR-CLIP

    121133
    Kakao BrainOctober 20, 2023bertchest_x_raycnn+8

    Large-scale chest X-ray vision-language pretraining model that learns image-report alignment for zero-shot and few-shot radiograph classification.

    Imaging
    18Openness
  • PMC-CLIP

    240
    Shanghai Jiao Tong UniversityMarch 13, 2023cnncontrastive_learninghistology+7

    A biomedical vision-language model trained with contrastive learning on 1.6M image-caption pairs (PMC-OA) mined from PubMed Central open-access articles.

    PathologyImaging
    63Openness
  • PTUnifier

    7852
    Chinese University of Hong Kong, Shenzhen +2 othersFebruary 17, 2023chest_x_rayfoundation_modelimage_text_retrieval+8

    Prompt-based medical vision-language pretraining that unifies fusion-encoder and dual-encoder architectures, handling image-only, text-only, and image-text inputs in one model.

    PathologyLanguage model
    56Openness
  • M3AE

    132183
    Shenzhen Research Institute of Big Data +2 othersSeptember 15, 2022autoencoderimage_text_retrievalmultimodal+5

    Self-supervised medical vision-and-language pretraining via multi-modal masked autoencoders that reconstruct masked image patches and text tokens.

    PathologyLanguage model
    29Openness
  • Shenzhen Research Institute of Big Data +2 othersSeptember 15, 2022chest_x_rayfoundation_modelimage_text_retrieval+7

    Knowledge-enhanced medical vision-and-language pre-training framework that aligns, reasons over, and learns from structured medical knowledge for radiology image-text tasks.

    ImagingLanguage model
    29Openness
  • PubMedCLIP

    18329414K
    Hasso Plattner InstituteDecember 27, 2021cnncontrastive_learninghistology+8

    A CLIP model fine-tuned on ROCO medical image-caption pairs to provide a medical-domain visual encoder for tasks such as medical visual question answering.

    PathologyLanguage model
    75Openness