All Competitors
Every biological foundation model, evaluated and ranked by the bio.rodeo team
Showing 1–19 of 19 filtered models
Merlin
41912712.5KA 3D vision-language foundation model for abdominal CT that pretrains on paired scans, radiology reports, and structured EHR codes for zero-shot interpretation.
ImagingLanguage model54OpennessNeuroVFM
204—University of Michigan +1 otherNovember 23, 2025ctfoundation_modeljoint_embedding_predictive_architecture+8A generalist neuroimaging vision foundation model pretrained on 5.24M clinical MRI and CT volumes for radiologic diagnosis and report generation.
Imaging57OpennessLingshu
31554.1KA generalist medical multimodal LLM built on Qwen2.5-VL for unified medical image understanding, visual question answering, report generation, and clinical reasoning across 12+ imaging modalities.
ImagingLanguage model70OpennessUniBiomed
619215Hong Kong University of Science and Technology +2 othersApril 30, 2025foundation_modelhistologymultimodal+6Universal foundation model that jointly generates diagnostic text and segments the corresponding targets across ten biomedical imaging modalities.
ImagingLanguage model64OpennessLLaVA-Rad
58631.2KLightweight 7B vision-language foundation model from Microsoft Research, released research-only under the Microsoft Research License, that generates radiology findings from chest X-rays.
ImagingLanguage model35OpennessMINIM
158127—A self-improving text-to-image diffusion foundation model that generates synthetic medical images across multiple modalities and organs to augment downstream clinical AI tasks.
Imaging41OpennessBrainfound
—3—Tsinghua University +2 othersJanuary 10, 2025contrastive_learningcross_modality_translationdiffusion+9A multimodal vision-text foundation model for brain CT and MRI, pretrained on ~10M paired images and reports to act as a clinical copilot across seven imaging tasks.
ImagingLanguage model7OpennessBiMediX2
731620Mohamed bin Zayed University of Artificial IntelligenceDecember 10, 2024histologyinstruction_tuninglanguage_model+7A bilingual (Arabic-English) bio-medical large multimodal model built on Llama 3.1 for medical image understanding and clinical text conversation.
Language modelImagingPathology11OpennessMedRegA
452613Hong Kong University of Science and Technology +1 otherOctober 24, 2024histologyimage_classificationinstruction_tuning+8Region-aware bilingual (Chinese-English) medical multimodal LLM that handles image- and region-level vision-language tasks across eight imaging modalities.
PathologyLanguage model65OpennessPULSE
6427999A multimodal LLM fine-tuned to interpret electrocardiogram images, trained on the >1M-sample ECGInstruct dataset and evaluated on the ECGBench benchmark.
BiosignalsImaging84OpennessECG-Chat
8044—China University of Geosciences +2 othersAugust 16, 2024cardiologycontrastive_learningdisease_classification+6A multimodal ECG-language model that aligns 12-lead ECG waveforms with clinical text for conversational cardiac diagnosis and automated report generation.
BiosignalsLanguage model27OpennessLLaVA-Tri
409843A medical multimodal large language model pretrained on the 25M-image MedTrinity-25M dataset, achieving state-of-the-art accuracy on biomedical visual question answering.
Language modelPathology30OpennessMedDr
982655Hong Kong University of Science and TechnologyApril 23, 2024foundation_modelhistologyinstruction_tuning+7A 40B-parameter generalist medical vision-language foundation model spanning radiology, pathology, dermatology, retinography, and endoscopy.
ImagingLanguage model69OpennessM3D
442159873A multimodal large language model for 3D medical imaging, handling retrieval, report generation, VQA, positioning, and segmentation on CT volumes.
ImagingLanguage model77OpennessCheXagent
226711.3KAn instruction-tuned vision-language foundation model from Stanford for interpreting and summarizing chest X-rays across eight clinical task types.
ImagingLanguage model32OpennessUniBrain
39——Hierarchical knowledge-enhanced vision-language pre-training model for universal brain MRI diagnosis across 10+ diseases from multi-modal scans and reports.
Imaging35OpennessRadFM
553227—A generalist radiology foundation model that handles interleaved 2D and 3D medical scans with text for diagnosis, VQA, and report generation.
ImagingLanguage model84OpennessPathAsst
13395—A multimodal generative AI assistant for pathology, pairing the PathCLIP vision encoder with a Vicuna-13B LLM and a toolkit of eight pathology-specific models.
PathologyLanguage model17OpennessPTUnifier
7852—Chinese University of Hong Kong, Shenzhen +2 othersFebruary 17, 2023chest_x_rayfoundation_modelimage_text_retrieval+8Prompt-based medical vision-language pretraining that unifies fusion-encoder and dual-encoder architectures, handling image-only, text-only, and image-text inputs in one model.
PathologyLanguage model56Openness