All Competitors
Every biological foundation model, evaluated and ranked by the bio.rodeo team
Showing 1–11 of 11 filtered models
LLaVA-Rad
58631.2KLightweight 7B vision-language foundation model from Microsoft Research, released research-only under the Microsoft Research License, that generates radiology findings from chest X-rays.
ImagingLanguage model35OpennessM3FM
1936—Multimodal, multidomain, multilingual medical foundation model that performs zero-shot clinical diagnosis and report generation from chest X-ray and CT images across English and Chinese.
ImagingLanguage model60OpennessMaCo
1272—A masked contrastive chest X-ray foundation model that aligns radiograph patches with report text for zero-shot and fine-grained diagnosis.
Imaging74OpennessMAIRA-2
—1393.3KMicrosoft Research multimodal LLM for grounded chest X-ray report generation, localizing each described finding with bounding boxes on the image.
ImagingLanguage model35OpennessCheXagent
226711.3KAn instruction-tuned vision-language foundation model from Stanford for interpreting and summarizing chest X-rays across eight clinical task types.
ImagingLanguage model32OpennessMAIRA-1
—87—Microsoft Research multimodal LLM that generates the findings section of a chest X-ray report from a single frontal image using a CXR-specific vision encoder and Vicuna-7B.
ImagingLanguage model6OpennessCXR-LLaVA
5432363Seoul National University +1 otherOctober 22, 2023abnormality_classificationchest_x_rayinstruction_tuning+6A publicly available vision-language model that interprets chest X-rays and generates radiology reports, built on a CXR-specific image encoder and LLaMA-2 (non-commercial license).
ImagingLanguage model27OpennessCXR-CLIP
121133—Large-scale chest X-ray vision-language pretraining model that learns image-report alignment for zero-shot and few-shot radiograph classification.
Imaging18OpennessPTUnifier
7852—Chinese University of Hong Kong, Shenzhen +2 othersFebruary 17, 2023chest_x_rayfoundation_modelimage_text_retrieval+8Prompt-based medical vision-language pretraining that unifies fusion-encoder and dual-encoder architectures, handling image-only, text-only, and image-text inputs in one model.
PathologyLanguage model56OpennessPCRLv2
10082—Self-supervised pre-training framework for medical image analysis that unifies pixel restoration with contrastive feature comparison across 2D and 3D modalities.
Imaging71Openness- Shenzhen Research Institute of Big Data +2 othersSeptember 15, 2022chest_x_rayfoundation_modelimage_text_retrieval+7
Knowledge-enhanced medical vision-and-language pre-training framework that aligns, reasons over, and learns from structured medical knowledge for radiology image-text tasks.
ImagingLanguage model29Openness