Microsoft Research

Part of Microsoft

The research division of Microsoft, spanning AI, systems, and quantum computing across global labs, with sustained work in biomedicine and health.

Location: Redmond, WA

Website

17 models(8 Protein, 7 Imaging, 5 Language model, 2 Pathology)

Models (17)

Vermeer

Microsoft Research / Broad Institute / Harvard University

Released June 1, 2026

——3

Generative microscopy foundation model that synthesizes in-silico fluorescence images of protein subcellular localization from amino-acid sequence.

ImagingProtein

CoLiPRI

Microsoft Research / German Cancer Research Center (DKFZ) / University of Cambridge / Heidelberg University / Mayo Clinic

Released October 20, 2025

155.7K—

Vision-language encoders for chest CT that align 3D volumes with radiology reports using contrastive, report-generation, and masked-image objectives.

Imaging

Dayhoff Atlas

Microsoft Research

Released July 21, 2025

11—99

Protein language models trained on billions of natural and synthetic sequences for de novo design and zero-shot mutation-effect prediction.

Protein

LLaVA-Rad

Microsoft Research

Released February 20, 2025

7352458

Chest X-ray vision-language model that drafts the findings section of a radiology report, at 7B parameters small enough to run on a single GPU.

ImagingLanguage model

BiomedParse

Microsoft Research

Released November 18, 2024

169539686

Biomedical imaging foundation model that segments, detects, and recognizes structures across nine modalities from natural language prompts.

Imaging

SFM-Protein

Microsoft Research

Released October 31, 2024

3——

Protein language model that captures short- and long-range residue co-evolution through a dual pre-training objective, at 3B parameters.

Protein

AlphaFlow-Lit

Microsoft Research

Released July 8, 2024

13——

Lightweight AlphaFlow variant that fine-tunes only AlphaFold's structure module, keeping the Evoformer frozen to cut conformational sampling cost.

Protein

MAIRA-2

Microsoft Research

Released June 6, 2024

1534.4K—

Microsoft Research multimodal LLM for grounded chest X-ray report generation, localizing each described finding with bounding boxes on the image.

ImagingLanguage model

Prov-GigaPath

Microsoft Research

Released May 22, 2024

940106.2K626

Whole-slide histopathology foundation model pretrained on 1.3 billion image tiles from 171,189 clinical slides spanning 31 tissue types.

Pathology

Distributional Graphormer

Microsoft Research

Released May 13, 2024

158—2.5K

Deep learning framework predicting equilibrium distributions of molecular systems, enabling efficient ensemble generation and conformation sampling.

Protein

MAIRA-1

Microsoft Research

Released November 22, 2023

92——

Radiology-specific multimodal LLM that generates the findings section of a chest X-ray report from a frontal image, pairing RAD-DINO with Vicuna-7B.

ImagingLanguage model

EvoDiff

Microsoft Research

Released September 12, 2023

226—675

Discrete diffusion model for protein sequence and MSA generation, enabling controllable de novo design directly in sequence space without structure.

Protein

ABGNN

Huazhong University of Science and Technology / Microsoft Research

Released August 6, 2023

26—55

Antibody CDR design framework pairing a pretrained antibody language model with a hierarchical graph neural network for one-shot CDR generation.

Protein

LLaVA-Med

Microsoft Research

Released June 1, 2023

1.9K12.2K2.2K

Biomedical vision-language assistant for question answering on radiology and pathology images, adapted from LLaVA on PubMed Central captions.

PathologyLanguage model

BiomedCLIP

Microsoft Research

Released March 1, 2023

665869.7K127

Biomedical vision-language model trained contrastively on 15M PubMed Central figure-caption pairs for zero-shot classification, retrieval, and VQA.

Imaging

BioGPT

Microsoft Research Asia / Microsoft Research

Released October 19, 2022

1.5K103.4K4.5K

Generative transformer pretrained on PubMed abstracts for biomedical text generation and mining, including relation extraction and question answering.

Language model

CARP

Microsoft Research

Released May 19, 2022

——259

Protein language model family built on CNNs rather than transformers, matching transformer quality while scaling linearly with sequence length.

Protein