bio.rodeo
ModelsOrganizationsLeaderboardAbout
bio.rodeo

The authoritative source for evaluating biological foundation models. No hype, just honest analysis.

AboutFAQSubmit a modelContact
© 2026 Pulsatance. All rights reserved. ~
Built by Pulsatance
Language model foundation models
Language modelSmall molecule

NetMedGPT

University of Hamburg / Brigham and Women's Hospital

A transformer foundation model trained on a biomedical knowledge graph for zero-shot drug-discovery inference across indications, targets, ADRs, contraindications, and off-label uses.

Released: January 2026

Drug discovery depends on reasoning over a dense web of relationships — which drugs hit which targets, which conditions they treat, which adverse effects they cause, and which patients they must not be given to. Network medicine encodes this web as a biomedical knowledge graph, but most predictive tools built on such graphs are trained one task at a time. NetMedGPT instead treats the knowledge graph itself as the substrate for a single foundation model that can answer many drug-discovery questions at once.

NetMedGPT is a transformer-based foundation model pretrained on a large-scale biomedical knowledge graph via masked-token prediction — the same self-supervised objective used by language models, applied here to the entities and relations of network medicine rather than to natural-language text. It was developed by the Baumbach group at the University of Hamburg, with network medicine pioneer Joseph Loscalzo (Harvard / Brigham and Women's Hospital) among the authors, and released as a bioRxiv preprint in January 2026.

By learning the structure of the biomedical graph, NetMedGPT performs zero-shot inference across multiple downstream tasks without task-specific retraining, distinguishing it from sequence-based biological foundation models. It is accompanied by a hosted, interactive chat-style interface that lets users query the model directly.

#Key Features

  • Knowledge-graph foundation model: Pretrained on a large biomedical knowledge graph using masked-token prediction, learning drug–target–disease relationships rather than molecular sequences.
  • Zero-shot across five tasks: Infers drug indications, targets, adverse drug reactions (ADRs), contraindications, and off-label uses from a single pretrained model without per-task fine-tuning.
  • Strong gains over specialized baselines: Reports AUPR improvements of roughly 2.2% to 26% over task-specific baselines across the evaluated drug-discovery tasks.
  • Generative subnetwork construction: Can generate relevant subnetworks around a query, supporting mechanistic interpretation in a network-medicine framework.
  • Hosted interactive interface: A public chat-style demo (ChatNetMedGPT) makes the model directly queryable by researchers and clinicians.

#Technical Details

NetMedGPT is a transformer pretrained with a masked-token-prediction objective over a biomedical knowledge graph, so that the model learns to fill in missing entities and relations — the network-medicine analogue of masked language modeling. This self-supervised pretraining yields representations that transfer zero-shot to five distinct drug-discovery prediction tasks: indications, targets, ADRs, contraindications, and off-label uses. Across these tasks the authors report area-under-the-precision-recall-curve (AUPR) gains of approximately 2.2–26% over specialized baseline methods, and validate predictions against external evidence including ClinicalTrials.gov. Unlike sequence-based foundation models for proteins or genomes, NetMedGPT operates entirely on graph structure. The preprint is released under a permissive CC BY license and is paired with a hosted interactive interface; at the time of writing no downloadable weights or code repository is linked.

#Applications

NetMedGPT is aimed at translational researchers, pharmacologists, and clinicians working on drug repurposing, safety assessment, and target identification. Because a single pretrained model answers questions about indications, targets, adverse reactions, contraindications, and off-label use, it can serve as a general-purpose reasoning layer over biomedical knowledge — for example proposing repurposing candidates, flagging potential adverse reactions or contraindications, or surfacing plausible off-label uses for further investigation. The generative subnetwork feature and the hosted chat interface make these predictions explorable in a network-medicine context.

#Impact

NetMedGPT demonstrates that the foundation-model recipe — large-scale self-supervised pretraining followed by zero-shot transfer — can be applied to biomedical knowledge graphs, not just to molecular sequences, and that doing so can outperform task-specific models across several drug-discovery problems. The involvement of a founder of network medicine and the validation against ClinicalTrials.gov lend credibility, and the permissive license and public interface support uptake. As a 2026 preprint its results await peer review, and the absence of a downloadable model or code is a current limitation for reproduction, though the hosted interface lets others probe the model's behavior.

Openness

bio.rodeo opennessClosed · low usability and reproducibility
24Closed
Usability — can I run it?17
Reproducibility — can I retrain it?18
Model Openness Framework
Unclassified
Missing required components

Tags

drug_discoverydrug_repurposinglink_predictiontransformerfoundation_modelself_supervisedzero_shotknowledge_graphnetwork_medicine

Resources

Research PaperDemo